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METHODS, COMPOSITIONS AND KITS 
FOR THE DETECTION AND MONITORING OF BREAST CANCER 

GOVERNMENTAL SUPPORT 
5 This work was supported in part by Grants CA-75794 and CA-80518 from 

the National Cancer Institute. The government may have certain rights in the invention. 

REFERENCE TO RELATED APPLICATIONS 

This application is a continuation-in-part of U.S. Application No. 
10 09/825,301, filed April 2, 2001, which is incorporated herein by reference in its entirety. 

TECHNICAL FIELD OF THE INVENTION 

The present invention relates generally to the field of cancer diagnostics. 
More specifically, the present invention relates to methods, compositions and kits for the 
detection of cancer that employ oligonucleotide hybridization and/or amplification to 
15 simultaneously detect two or more tissue-specific polynucleotides in a biological sample 
suspected of containing cancer cells. 

BACKGROUND OF THE INVENTION 

Field of the Invention 

Cancer remains a significant health problem throughout the world. The 
20 failure of conventional cancer treatment regimens can commonly be attributed, in part, to 

delayed disease diagnosis. Although significant advances have been made in the area of 

cancer diagnosis, there still remains a need for improved detection methodologies that 

permit early, reliable and sensitive determination of the presence of cancer cells. 

Description of the Related Art 
25 Breast cancer is second only to lung cancer in mortality among women in 

the U.S., affecting more than 180,000 women each year and resulting in approximately 
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40,000-50,000 deaths annually. For women in North America, the life-time odds of getting 
breast cancer are one in eight. 

Management of the disease currently relies on a combination of early 
diagnosis (through routine breast screening procedures) and aggressive treatment, which 
5 may include one or more of a variety of treatments such as surgery, radiotherapy, 
chemotherapy and hormone therapy. The course of treatment for a particular breast cancer 
is often selected based on a variety of prognostic parameters, including analysis of specific 
tumor markers. See, e.g., Porter-Jordan et al., Breast Cancer 8:73-100 (1994). The use of 
established markers often leads, however, to a result that is difficult to interpret; and the 

10 high mortality observed in breast cancer patients indicates that improvements are needed in 
the diagnosis of the disease. 

The recent introduction of immunotherapeutic approaches to breast cancer 
treatment which are targeted to Her2/neu have provided significant motivation to identify 
additional breast cancer specific genes as targets for therapeutic antibodies and T-cell 

15 vaccines as well as for diagnosis of the disease. To this end, mammaglobin, has been 
identified as one of the most breast-specific genes discovered to date, being expressed in 
approximately 70-80% of breast cancers. Because of its highly tissue-specific distribution, 
detection of mammaglobin gene expression has been used to identify micrometastatic 
lesions in lymph node tissues and, more recently, to detect circulating breast cancer cells in 

20 peripheral blood of breast cancer patients with known primary and metastatic lesions. 

Mammaglobin is a homologue of a rabbit uteroglobin and the rat steroid 
binding protein subunit C3 and is a low molecular weight protein that is highly 
glycosylated. Watson et al, Cancer Res, 56:860-5 (1996); Watson et al., Cancer Res. 
59:3028-3031 (1999); Watson et al, Oncogene 16:817-24 (1998). In contrast to its 

25 homologs, mammaglobin has been reported to be breast specific and overexpression has 
been described in breast tumor biopsies (23%), primary and metastatic breast tumors 
(-75%) with reports of the detection of mammaglobin mRNA expression in 91% of lymph 
nodes firom metastatic breast cancer patients. Leygue et al., J. Pathol 189:28-33 (1999) 
and Min et al. Cancer Res, 58:4581-4584 (1998). 
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Since mammaglobin gene expression is not a universal feature of breast 
cancer, the detection of this gene alone may be insufficient to permit the reliable detection 
of all breast cancers. Accordingly, what is needed in the art is a methodology that employs 
the detection of two or more breast cancer specific genes in order to improve the sensitivity 
5 and reliability of detection of micrometastases, for example, in lymph nodes and bone 
marrow and/or for recognition of anchorage-independent cells in the peripheral circulation. 

The present invention achieves these and other related objectives by 
providing methods that are useful for the identification of tissue-specific polynucleotides, 
in particular tumor-specific polynucleotides, as well as methods, compositions and kits for 
1 0 the detection and monitoring of cancer cells in a patient afflicted with the disease. 



SUMMARY OF THE INVENTION 

By certain embodiments, the present invention provides 
methods for identifying one or more tissue-specific polynucleotides which methods 
comprise the steps of: (a) performing a genetic subtraction to identify a pool of 

15 polynucleotides firom a tissue of interest; (b) performing a DNA microarray analysis to 
identify a first subset of said pool of polynucleotides of interest wherein each member 
polynucleotide of said first subset is at least two-fold over-expressed in said tissue of 
interest as compared to a control tissue; and (c ) performing a quantitative polymerase 
chain reaction analysis on polynucleotides within said first subset to identify a second 

20 subset of polynucleotides that are at least two-fold over-expressed as compared to the 
control tissue. Preferred genetic subtractions are selected from the group consisting of 
differential display and cDNA subtraction and are described in further detail herein below. 

Altemate embodiments of the present invention provide methods of 
identifying a subset of polynucleotides showing concordant and/or complementary tissue- 

25 specific expression profiles in a tissue of interest. Such methods comprise the steps of, (a) 
performing an expression analysis selected from the group consisting of DNA microarray 
and quantitative PGR to identify a first polynucleotides that is at least two-fold over- 
expressed in a tissue of interest as compared to a control tissue; and (b) performing an 
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expression analysis selected from the group consisting of DNA microarray and quantitative 
PGR to identify a first polynucleotides that is at least two-fold over-expressed in a tissue of 
interest as compared to a control tissue. 

Further embodiments of the present invention provide methods for detecting 
5 the presence of a cancer cell in a patient. Such methods comprise the steps of: (a) 
obtaining a biological sample fi-om the patient; (b) contacting the biological sample with a 
first oligonucleotide pair wherein the members of the first oligonucleotide pair hybridize, 
under moderately stringent conditions, to a first polynucleotide and the complement 
thereof, respectively; (c) contacting the biological sample with a second oligonucleotide 

10 pair wherein the members of the second oligonucleotide pair hybridize, under moderately 
stringent conditions, to a second polynucleotide and the complement thereof, respectively 
and wherein the first polynucleotide is unrelated in nucleotide sequence to the second 
polynucleotide; (d) amplifying the first polynucleotide and the second polynucleotide; and 
(e) detecting the amplified first polynucleotide and the amplified second polynucleotide; 

15 wherein the presence of the amplified first polynucleotide or amplified second 
polynucleotide indicates the presence of a cancer cell in the patient. 

By some embodiments, detection of the amplified first and/or second 
polynucleotides may be preceded by a fractionation step such as, for example, gel 
electrophoresis. Alternatively or additionally, detection of the amplified first and/or second 

20 polynucleotides may be achieved by hybridization of a labeled oligonucleotide probe that 
hybridizes specifically, under moderately stringent conditions, to the first or second 
polynucleotide. Oligonucleotide labeling may be achieved by incorporating a radiolabeled 
nucleotide or by incorporating a fluorescent label. 

In certain preferred embodiments, cells of a specific tissue type may be 

25 enriched from the biological sample prior to the steps of detection. Enrichment may be 
achieved by a methodology selected from the group consisting of cell capture and cell 
depletion. Exemplary cell capture methods include knmunocapture and comprise the steps 
of: (a) adsorbing an antibody to a tissue-specific cell surface to cells said biological 
sample; (b) separating the antibody adsorbed tissue-specific cells from the remainder of the 
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biological sample. Exemplary cell depletion may be achieved by cross-linking red cells 
and white cells followed by a subsequent fractionation step to remove the cross-linked 
cells. 

Altemative embodiments of the present invention provide methods for 
5 determining the presence or absence of a cancer in a patient, comprising the steps of: (a) 
contacting a biological sample obtained from the patient with an oligonucleotide that 
hybridizes to a polynucleotide that encodes a breast tumor protein; (b) detecting in the 
sample a level of a polynucleotide (such as, for example, mRNA) that hybridizes to the 
oligonucleotide; and (c) comparing the level of polynucleotide that hybridizes to the 

10 oligonucleotide with a predetermined cut-off value, and therefrom determining the 
presence or absence of a cancer in the patient. Within certain embodiments, the amount of 
mRNA is detected via polymerase chain reaction using, for example, at least one 
oligonucleotide primer that hybridizes to a polynucleotide encoding a polypeptide as 
recited above, or a complement of such a polynucleotide. Within other embodiments, the 

15 amount of mRNA is detected using a hybridization technique, employing an 
oligonucleotide probe that hybridizes to a polynucleotide that encodes a polypeptide as 
recited above, or a complement of such a polynucleotide. 

In related aspects, methods are provided for monitoring the progression of a 
cancer in a patient, comprising the steps of: (a) contacting a biological sample obtained 

20 from a patient with an oligonucleotide that hybridizes to a polynucleotide that encodes a 
breast tumor protein; (b) detecting in the sample an amount of a polynucleotide that 
hybridizes to the oligonucleotide; (c) repeating steps (a) and (b) using a biological sample 
obtained from the patient at a subsequent point in time; and (d) comparing the amount of 
polynucleotide detected in step (c) with the amount detected in step (b) and therefrom 

25 monitoring the progression of the cancer in the patient. 

Certain embodiments of the present invention provide that the step of 
amplifying said first polynucleotide and said second polynucleotide is achieved by the 
polymerase chain reaction (PGR). 
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Within certain embodiments, the cancer cell to be detected may be selected 
from the group consisting of prostate cancer, breast cancer, colon cancer, ovarian cancer, 
lung cancer head & neck cancer, lymphoma, leukemia, melanoma, liver cancer, gastric 
cancer, kidney cancer, bladder cancer, pancreatic cancer and endometrial cancer. Still 
5 further embodiments of the present invention provide that the biological sample is selected 
from the group consisting of blood, a lymph node and bone marrow. The lymph node may 
be a sentinel lymph node. 

Within specific embodiments of present invention it is provided that the first 
polynucleotide is selected from the group consisting of mammaglobin, lipophilin B, 
10 GABAtc (B899P), B726P, B511S, B533S, B305D and B311D. Other embodiments 
provide that the second polynucleotide is selected from the group consisting of 
mammaglobin, lipophilin B, GABAtt (B899P), B726P, B511S, B533S, B305D and 
B311D. 

Altemate embodiments of the present invention provide methods for 
15 detecting the presence or absence of a cancer in a patient, comprising the steps of: (a) 
contacting a biological sample obtained from a patient with a first oligonucleotide that 
hybridizes to a polynucleotide selected from the group consisting of mammaglobin and 
lipophilin B; (b) contacting the biological sample with a second oligonucleotide that 
hybridizes to a polynucleotide sequence selected from the group consisting of GABAti 
20 (B899P), B726P, B511S, B533S, B305D and B311D; (c) detecting in the sample an 
amount of a polynucleotide that hybridizes to at least one of the oligonucleotides; and (d) 
comparing the amount of polynucleotide that hybridizes to the oligonucleotide to a 
predetermined cut-off value, and therefrom determining the presence or absence of a 
cancer in the patient. 

25 According to certain embodiments, oligonucleotides may be selected from 

those disclosed herein such as those presented in SEQ ID NO:33-72. By other 
embodiments, the amount of polynucleotide that hybridizes to the oligonucleotide is 
determined using a polymerase chain reaction. Altematively, the amount of polynucleotide 
that hybridizes to the oligonucleotide may be determined using a hybridization assay. 
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Still other embodiments of the present invention provide methods for 
determining the presence or absence of a cancer cell in a patient, comprising the steps of: 
(a) contacting a biological sample obtained from a patient with a first oligonucleotide that 
hybridizes to a polynucleotide selected firom the group consisting of a polynucleotide 
5 depicted in SEQ ID NO:73 and SEQ ID NO:74 or complement thereof; (b) contacting the 
biological sample with a second oligonucleotide that hybridizes to a polynucleotide 
depicted in SEQ ID NO:75 or complement thereof; (c) contacting the biological sample 
with a third oligonucleotide that hybridizes to a polynucleotide selected from the group 
consisting of a polynucleotide depicted in SEQ ID N0:1, SEQ ID N0:3, SEQ ID N0:5, 

10 SEQ ID N0:6 and SEQ ID N0:7 or complement thereof; (d) contacting the biological 
sample with a fourth oligonucleotide that hybridizes to a polynucleotide selected from the 
group consisting of a polynucleotide depicted in SEQ ID N0:1 1 or complement thereof; (e) 
contacting the biological sample with a fifth oligonucleotide that hybridizes to a 
polynucleotide selected from the group consisting of a polynucleotide depicted in SEQ ID 

15 N0:13, 15 and 17 or complement thereof; (f) contacting the biological sample with a sixth 
oligonucleotide that hybridizes to a polynucleotide selected from the group consisting of a 
polynucleotide depicted in SEQ ID N0:19, SEQ ID NO:20, SEQ ID N0:21, SEQ ID 
NO:22, SEQ ID NO:23 and SEQ ID NO:24 or complement thereof; (g) contacting the 
biological sample with a seventh oligonucleotide that hybridizes to a polynucleotide 

20 depicted in SEQ ID NO:30 or complement thereof; (h) contacting the biological sample 
with an eighth oligonucleotide that hybridizes to a polynucleotide depicted in SEQ ID 
NO:32 or complement thereof; (i) contacting the biological sample with a ninth 
oligonucleotide that hybridizes to a polynucleotide depicted in SEQ ID NO: 76 or 
complement thereof; (j) detecting in the sample a hybridized oligonucleotide of any one of 

25 steps (a) through (i); and (j) comparing the amount of polynucleotide that hybridizes to the 
oligonucleotide to a predetermined cut-off value, wherein the presence of a hybridized 
oligonucleotide in any one of steps (a) through (i) in excess of the pre-determined cut-off 
value indicates the presence of a cancer cell in the biological sample of said patient. 
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Other related embodiments of the present invention provide methods for 
determining the presence or absence of a cancer cell in a patient, comprising the steps of: 
(a) contacting a biological sample obtained from a patient with a first oligonucleotide and a 
second oligonucleotide wherein said first and second oligonucleotides hybridize under 
5 moderately stringent conditions to a first and a second polynucleotide selected from the 
group selected from the group consisting of SEQ ID NO:73, SEQ ID NO:74, SEQ ID 
NO:75, SEQ ID N0:1, SEQ ID N0:3, SEQ ID N0:5, SEQ ID N0:6, SEQ ID N0:7, SEQ 
ID N0:11, SEQ ID N0:13, SEQ ID N0:15, SEQ ID N0:17, SEQ ID N0:19, SEQ ID 
NO:20, SEQ ID N0:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:30, 

10 SEQ ID NO:32, and SEQ ID NO:76 and wherein said first polynucleotide is unrelated 
structurally to said second polynucleotide; (b) detecting in the sample said first and said 
second hybridized oligonucleotides; and (c) comparing the amount of polynucleotide that 
hybridizes to the oligonucleotide to a predetermined cut-off value, wherein the presence of 
a hybridized first oligonucleotide or a hybridized second oligonucleotide in excess of the 

1 5 pre-determined cut-off value indicates the presence of a cancer cell in the biological sample 
of said patient. 

Other related embodiments of the present invention provide methods for 
determining the presence or absence of a cancer cell in a patient, comprising the steps of: 
(a) contacting a biological sample obtained from a patient with a first oligonucleotide and a 

20 second oligonucleotide wherein said first and second oligonucleotides hybridize under 
moderately stringent conditions to a first and a second polynucleotide are both tissue- 
specific polynucleotides of Ihe cancer to be detected and wherein said first polynucleotide 
is unrelated structurally to said second polynucleotide; (b) detecting in the sample said first 
and said second hybridized oligonucleotides; and (c) comparing the amount of 

25 polynucleotide that hybridizes to the oligonucleotide to a predetermined cut-off value, 
wherein the presence of a hybridized first oligonucleotide or a hybridized second 
oligonucleotide in excess of the pre-determined cut-off value indicates the presence of a 
cancer cell in the biological sample of said patient. 
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In other related aspects, the present invention further provides compositions 
useful in the methods disclosed herein. Exemplary compositions comprise two or more 
oligonucleotide primer pairs each one of which specifically hybridizes to a distinct 
polynucleotide. Exemplary oligonucleotide primers suitable for compositions of the 
5 present invention are disclosed herein by SEQ ID NO: 33-71. Exempli polynucleotides 
suitable for compositions of the present invention are disclosed in SEQ ID NO:73, SEQ ID 
NO:74, SEQ ID NO:75, SEQ ID N0:1, SEQ ID N0:3, SEQ ID N0:5, SEQ ID N0:6, SEQ 
ID N0:7, SEQ ID N0:11, SEQ ID N0:13, SEQ ID N0:15, SEQ ID N0:17, SEQ ID 
N0:19, SEQ ID NO:20, SEQ ID N0:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, 

10 SEQ ID NO:30, SEQ ID NO:32, and SEQ ID NO:76. 

The present invention also provides kits that are suitable for performing the 
detection methods of the present invention. Exemplary kits comprise oligonucleotide 
primer pairs each one of which specifically hybridizes to a distinct polynucleotide. Within 
certain embodiments, kits according to the present invention may also comprise a nucleic 

15 acid polymerase and suitable buffer. Exemplary oligonucleotide primers suitable for kits 
of the present invention are disclosed herein by SEQ ID NO: 33-71. Exemplary 
polynucleotides suitable for kits of the present invention are disclosed in SEQ ID NO:73, 
SEQ ID NO:74, SEQ ID NO:75, SEQ ID N0:1, SEQ ID N0:3, SEQ ID N0:5, SEQ ID 
N0:6, SEQ ID N0:7, SEQ ID N0:11, SEQ ID N0:13, SEQ ID N0:15, SEQ ID N0:17, 

20 SEQ ID N0:19, SEQ ID NO:20, SEQ ID N0:21, SEQ ID N0:22, SEQ ID NO:23, SEQ ID 
NO:24, SEQ ID NO:30, SEQ ID NO:32, and SEQ ID NO:76. 

These and other aspects of the present invention will become apparent upon 
reference to the following detailed description and attached drawings. All references 
disclosed herein are hereby incorporated by reference in their entirety as if each was 

25 incorporated individually. 
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BRIEF DESCRIPTION OF THE DRAWINGS AND SEQUENCE IDENTIFIERS 

Figure 1 shows the mRNA expression profiles for B3 1 ID, B533S 
B726P as determined using quantitative PCR (Taqman^^). Abbreviations: B.T.: Breast 
tumor; B.M.: Bone marrow; B.R.: Breast reduction. 
5 Figure 2 shows the relationship of B533S expression to pathological stage 

of tumor. Tissues from normal breast (8), benign breast disorders (3), and breast tumors 
stage I (5), stage II (6), stage III (7), stage IV (3) and metastases (1 lymph node and 3 
pleural effusions) were tested in real-time PCR. The data is expressed as the mean 
copies/ng (J-actin for each group tested and the line is the calculated trend line, 
10 Figures 3 A and 3B show the gene complementation of B305D C-form, 

B726P, GABAix and mammaglobin in metastases and primary tumors, respectively. The 
cut-off for each of the genes was 6.57, 1 .65, 4.58 and 3.56 copies/ng p-Actin based on the 
mean of the negative normal tissues plus 3 standard deviations. 

Figure 4 shows the full-length cDNA sequence for mammaglobin. 
1 5 Figure 5 shows the determined cDNA sequence of the open reading frame 

encoding a mammaglobin recombinant polypeptide expressed in E, coli. 

Figure 6 shows the full-length cDNA sequence for GABAtt. 
Figure 7 shows the mRNA expression levels for mammaglobin, GABAti, 
B305D (C form) and B726P in breast tumor and normal samples determined using real- 
20 time PCR and the SYBR detection system. Abbreviations: BT: Breast tumor; BR: Breast 
reduction; A. PBMC: Activated peripheral blood mononuclear cells; R. PBMC: resting 
PBMC; T. Gland: Thyroid gland; S. Cord: Spinal Cord; A. Gland: Adrenal gland; B. 
Marrow: Bone marrow; S. Muscle: Skeletal muscle. 

Figure 8 is a bar graph showing a comparison between the LipophilinB 
25 alone and the LipophilinB-B899P-B305D-C-B726 multiplex assays tested on a panel of 
breast tumor samples. Abbreviations: BT: Breast tumor; BR: Breast reduction; SCID: 
severe combined immunodeficiency. 

Figure 9 is a gel showing the xmique band length of four amplification 
products of tumor genes of interest (mammaglobin, B305D, B899P, B726P) tested in a 
30 multiplex Real-time PCR assay. 
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Figure 10 shows a comparison of a multiplex assay using intron-exon border 
spanning primers (bottom panel) and those using non-optimized primers (top panel), to 
detect breast cancer cells in a panel of lymph node tissues. 

SEQ ID NO: 1 is the determined cDNA sequence for a first splice variant of 
5 B305D isoform A. 

SEQ ID NO: 2 is the amino acid sequence encoded by the sequence of SEQ 

ID NO: 1. 

SEQ ID NO: 3 is the determined cDNA sequence for a second splice variant 
ofB305D isoform A. 

10 SEQ ID NO: 4 is the amino acid sequence encoded by the sequence of SEQ 

ID NO: 3. 

SEQ ID NO: 5-7 are the determined cDNA sequences for three splice 
variants of B305D isoform C, 

SEQ ID NO: 8-10 are the amino acid sequences encoded by the sequence of 
1 5 SEQ ID NO: 5-7, respectively. 

SEQ ID NO: 1 1 is the determined cDNA sequence for 33 1 ID. 

SEQ ID NO: 12 is the amino acid sequence encoded by the sequence of 
SEQ ID NO: 11. 

SEQ ID NO: 13 is the determined cDNA sequence of a first splice variant of 

20 B726P. 

SEQ ID NO: 14 is the amino acid sequence encoded by the sequence of 
SEQ ID NO: 13. 

SEQ ID NO: 15 is the determined cDNA sequence of a second splice 
variant of B726P. 

25 SEQ ID NO: 16 is the amino acid sequence encoded by the sequence of 

SEQ ID NO: 15. 

SEQ ID NO: 17 is the determined cDNA sequence of a third splice variant 

ofB726P. 

SEQ ID NO: 18 is the amino acid sequence encoded by the sequence of 
30 SEQ ID NO: 17. 
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SEQ ID NO: 19-24 are the deteraiined cDNA sequences of further splice 
variants of B726P. 

SEQ ID NO: 25-29 are the amino acid sequences encoded by SEQ ID NO: 
19-24, respectively. 

5 SEQ ID NO: 30 is the determined cDNA sequence for B5 1 1 S. 

SEQ ID NO: 31 is the amino acid sequence encoded by SEQ ID NO: 30. 

SEQ ID NO: 32 is the determined cDNA sequence for B533S. 

SEQ ID NO:33 is the DNA sequence of Lipophilin B forward primer, 

SEQ ID NO: 34 is the DNA sequence of Lipophilin B reverse primer. 
1 0 SEQ ID NO:35 is the DNA sequence of Lipophilin B probe. 

SEQ ID NO:36 is the DNA sequence of GABA (B899P) forward primer. 

SEQ ID NO:37 is the DNA sequence of GABA (B899P) reverse primer. 

SEQ ID NO:38 is the DNA sequence of GABA (B899P) probe. 

SEQ ID NO:39 is the DNA sequence of B305D (C form) forward primer. 
1 5 SEQ ID NO:40 is the DNA sequence of B305D (C form) reverse primer. 

SEQ ID N0:41 is the DNA sequence of B305D (C form) probe. 

SEQ ID NO:42 is the DNA sequence of B726P forward primer. 

SEQ ID NO:43 is the DNA sequence of B726P reverse primer. 

SEQ ID NO:44 is the DNA sequence of B726P probe. 
20 SEQ ID NO:45 is the DNA sequence of Actin forward primer. 

SEQ ID NO:46 is the DNA sequence of Actin reverse primer. 

SEQ ID NO:47 is the DNA sequence of Actin probe. 

SEQ ID NO:48 is the DNA sequence of Mammaglobin forward primer. 

SEQ ID NO:49 is the DNA sequence of Mammaglobin reverse primer. 
25 SEQ ID NO:50 is the DNA sequence of Mammaglobin probe. 

SEQ ID N0:51 is the DNA sequence of a second GABA (B899P) reverse 



primer. 



30 primer. 



SEQ ID NO:52 is the DNA sequence of a second B726P forward primer. 
SEQ ID NO:53 is the DNA sequence of a GABA B899P-INT forward 
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SEQ ID NO:54 is the DNA sequence of a GABA B899P-INT reverse 

primer. 

SEQ ID NO:55 is the DNA sequence of a GABA B899P-INT Taqman 

probe. 

5 SEQ ID NO:56 is the DNA sequence of a B305D-INT forward primer. 

SEQ ID NO:57 is the DNA sequence of a B305D-INT reverse primer. 

SEQ ID NO:58 is the DNA sequence of a B305D-INT Taqman probe. 

SEQ ID NO:59 is the DNA sequence of a B726-INT forward primer. 

SEQ ID NO:60 is the DNA sequence of a B726-INT reverse primer. 
1 0 SEQ ID N0:61 is the DNA sequence of a B726-INT Taqman probe. 

SEQ ID NO:62 is the DNA sequence of a GABA B899P Taqman probe. 

SEQ ID NO:63 is the DNA sequence of a B31 ID forward primer. 

SEQ ID NO:64 is the DNA sequence of a B3 1 ID reverse primer. 

SEQ ID NO:65 is the DNA sequence of a B311D Taqman probe. 
15 SEQ ID NO:66 is the DNA sequence of a B533S forward primer. 

SEQ ID NO:67 is the DNA sequence of a B533S reverse primer. 

SEQ ID NO:68 is the DNA sequence of a B533S Taqman probe. 

SEQ ID NO:69 is the DNA sequence of a B511 S forward primer. 

SEQ ID NO:70 is the DNA sequence of a B5 1 1 S reverse primer. 
20 SEQ ID N0:71 is the DNA sequence of a B511S Taqman probe. 

SEQ ID NO:72 is the DNA sequence of a GABAti reverse primer. 

SEQ ID NO:73 is the full-length cDNA sequence for mammaglobin. 

SEQ ID NO:74 is the determined cDNA sequence of the open reading frame 
encoding a mammaglobin recombinant polypeptide expressed in E, coli. 
25 SEQ ID NO:75 is the full-length cDNA sequence for GABAti. 

SEQ ID NO:76 is the full-length cDNA sequence for lipophilin B. 

SEQ ID NO:77 is the amino acid sequence encoded by the sequence of SEQ 

IDNO:76. 
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DETAILED DESCRIPTION OF THE INVENTION 

As noted above, the present invention is directed generally to methods that 
are suitable for the identification of tissue-specific polynucleotides as well as to methods, 
compositions and kits that are suitable for the diagnosis and monitoring of cancer. While 
5 certain exemplary methods, compositions and kits disclosed herein are directed to the 
identification, detection and monitoring of breast cancer, in particular breast cancer- 
specific polynucleotides, it will be understood by those of skill in the art that the present 
invention is generally applicable to the identification, detection and monitoring of a wide 
variety of cancers, and the associated over-expressed polynucleotides, including, for 
10 example, prostate cancer, breast cancer, colon cancer, ovarian cancer, lung cancer, head & 
neck cancer, lymphoma, leukemia, melanoma, liver cancer, gastric cancer, kidney cancer, 
bladder cancer, pancreatic cancer and endometrial cancer. Thus, it will be apparent that the 
present invention is not limited solely to the identification of breast cancer-specific 
polynucleotides or to the detection and monitoring of breast cancer. 

15 

Identification of Tissue-specific Polvnucleotides 

Certain embodiments of the present invention provide methods, 
compositions and kits for the detection of a cancer cell within a biological sample. These 
methods comprise the step of detecting one or more tissue-specific polynucleotide(s) fi*om 

20 a patient's biological sample the over-expression of which polynucleotides indicates the 
presence of a cancer cell within the patient's biological sample. Accordingly, the present 
invention also provides methods that are suitable for the identification of tissue-specific 
polynucleotides. As used herein, the phrases "tissue-specific polynucleotides" or "tumor- 
specific polynucleotides" are meant to include all polynucleotides that are at least two-fold 

25 over-expressed as compared to one or more control tissues. As discussed in further detail 
herein below, over-expression of a given polynucleotide may be assessed, for example, by 
microarray and/or quantitative real-time polymerase chain reaction (Real-time PCR™) 
methodologies. 
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Exemplary methods for detecting tissue-specific polynucleotides may 
comprise the steps of: (a) performing a genetic subtraction to identify a pool of 
polynucleotides from a tissue of interest; (b) performing a DNA microarray analysis to 
identify a first subset of said pool of polynucleotides of interest wherein each member 
5 polynucleotide of said first subset is at least two-fold over-expressed in said tissue of 
interest as compared to a control tissue; and (c) performing a quantitative polymerase chain 
reaction analysis on polynucleotides within said first subset to identify a second subset of 
polynucleotides that are at least two-fold over-expressed as compared to said control tissue. 
Polynucleotides Generally 

10 As used herein, the term "polynucleotide" refers generally to either DNA or 

RNA molecules. Polynucleotides may be naturally occurring as normally found in a 
biological sample such as blood, serum, lymph node, bone marrow, sputum, urine and 
tumor biopsy samples. Alternatively, polynucleotides may be derived synthetically by, for 
example, a nucleic acid polymerization reaction. As will be recognized by the skilled 

15 artisan, polynucleotides may be single-stranded (coding or antisense) or double-stranded, 
and may be DNA (genomic, cDNA or synthetic) or RNA molecules. RNA molecules 
include HnRNA molecules, which contain introns and correspond to a DNA molecule in a 
one-to-one manner, and mRNA molecules, which do not contain introns. Additional 
coding or non-coding sequences may, but need not, be present within a polynucleotide of 

20 the present invention, and a polynucleotide may, but need not, be linked to other molecules 
and/or support materials. 

Polynucleotides may comprise a native sequence (ie. an endogenous 
sequence that encodes a tumor protein, such as a breast tumor protein, or a portion thereof) 
or may comprise a variant, or a biological or antigenic functional equivalent of such a 

25 sequence. Polynucleotide variants may contain one or more substitutions, additions, 
deletions and/or insertions, as further described below. The term "variants" also 
encompasses homologous genes of xenogenic origin. 

When comparing polynucleotide or polypeptide sequences, two sequences 
are said to be "identical" if the sequence of nucleotides or amino acids in the two sequences 
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is the same when aligned for maximum correspondence, as described below. Comparisons 
between two sequences are typically performed by comparing the sequences over a 
comparison window to identify and compare local regions of sequence similarity. A 
"comparison window" as used herein, refers to a segment of at least about 20 contiguous 

5 positions, usually 30 to about 75, 40 to about 50, in which a sequence may be compared to 
a reference sequence of the same number of contiguous positions after the two sequences 
are optimally aligned. 

Optimal alignment of sequences for comparison may be conducted using the 
Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, Inc., 

10 Madison, WI), using default parameters. This program embodies several alignment 
schemes described in the following references: Dayhoff, M.O. (1978) A model of 
evolutionary change in proteins - Matrices for detecting distant relationships. In Dayhoff, 
M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical Research 
Foundation, Washington DC Vol 5, Suppl. 3, pp. 345-358; Hein J. (1990) Unified 

15 Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology vol. 183, 
Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) CABIOS 
5:151-153; Myers, E.W. and MuUer W. (1988) CABIOS ^:11-17; Robinson, E.D. (1971) 
Comb. Theor 77:105; Santou, N. Nes, M. (1987) Mol Biol EvoL 4:406-425; Sneath, 
P.H.A. and Sokal, R.R. (1973) Numerical Taxonomy - the Principles and Practice of 

20 Numerical Taxonomy, Freeman Press, San Francisco, CA; Wilbur, W.J. and Lipman, D.J. 
(1 983) Proc, Natl Acad, Sci. USA 80:726-730. 

Altematively, optimal alignment of sequences for comparison may be 
conducted by the local identity algorithm of Smith and Waterman (1981) Add, APL Math 
2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) J. Mol, Biol. 

25 48:443, by the search for similarity methods of Pearson dnd Lipman (1988) Proc. Natl 
Acad Sci USA 85: 2444, by computerized implementations of these algorithms (GAP, 
BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), or by inspection. 
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One preferred example of algorithms that are suitable for determining 
percent sequence identity and sequence similarity are the BLAST and BLAST 2,0 
algorithms, which are described in Altschul et al (1977) Nucl Acids Res, 25:3389-3402 
and Altschul et al. (1990) J, Mol Biol 215:403-410, respectively. BLAST and BLAST 2.0 
5 can be used, for example with the parameters described herein, to determine percent 
sequence identity for the polynucleotides and polypeptides of the invention. Software for 
performing BLAST analyses is publicly available through the National Center for 
Biotechnology Information. In one illustrative example, cumulative scores can be 
calculated using, for nucleotide sequences, the parameters M (reward score for a pair of 

10 matching residues; always >0) and N (penalty score for mismatching residues; always <0). 
For amino acid sequences, a scoring matrix can be used to calculate the cumulative score. 
Extension of the word hits in each direction are halted when: the cumulative alignment 
score falls off by the quantity X from its maximum achieved value; the cumulative score 
goes to zero or below, due to the accumulation of one or more negative-scoring residue 

15 alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, 
T and X determine the sensitivity and speed of the alignment. The BLASTN program (for 
nucleotide sequences) uses as defaults a wordlength (W) of 1 1, and expectation (E) of 10, 
and the BLOSUM62 scoring matrix (see Henikoff and Henikoff (1989) Proc. Natl Acad. 
Sol USA 89:10915) alignments, (B) of 50, expectation (E) of 10, M=5, N=-4 and a 

20 comparison of both strands. 

Preferably, the "percentage of sequence identity" is determined by 
comparing two optimally aligned sequences over a window of comp^ison of at least 20 
positions, wherein the portion of the polynucleotide or polypeptide sequence in the 
comparison window may comprise additions or deletions (z.^., gaps) of 20 percent or less, 

25 usually 5 to 15 percent, or 10 to 12 percent, as compared to the reference sequences (which 
does not comprise additions or deletions) for optimal alignment of the two sequences. The 
percentage is calculated by determining the number of positions at which the identical 
nucleic acid bases or amino acid residue occurs in both sequences to yield the number of 
matched positions, dividing the number of matched positions by the total number of 
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positions in the reference sequence (i.e., the window size) and multiplymg the results by 
100 to yield the percentage of sequence identity • 

Therefore, the present invention encompasses polynucleotide and 
polypeptide sequences having substantial identity to the sequences disclosed herein, for 

5 example those comprising at least 50% sequence identity, preferably at least 55%, 60%, 
65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or higher, sequence 
identity compared to a polynucleotide or polypeptide sequence of this invention using the 
methods described herein, (e.g., BLAST analysis using standard parameters, as described 
below). One skilled in this art vdll recognize that these values can be appropriately 

10 adjusted to determine corresponding identity of proteins encoded by two nucleotide 
sequences by taking into account codon degeneracy, amino acid similarity, reading frame 
positioning and the like. 

In additional embodiments, the present invention provides isolated 
polynucleotides and polypeptides comprising various lengths of contiguous stretches of 

15 sequence identical to or complementary to one or more of the sequences disclosed herein. 
For example, polynucleotides are provided by this invention that comprise at least about 
15, 20, 30, 40, 50, 75, 100, 150, 200, 300, 400, 500 or 1000 or more contiguous nucleotides 
of one or more of the sequences disclosed herein as well as all intermediate lengths there 
between. It v^U be readily understood that "intermediate lengths", in this context, means 

20 any length between the quoted values, such as 16, 17, 18, 19, etc.\ 21, 22, 23, etc,\ 30, 31, 
32, etc,\ 50, 51, 52, 53, etc.\ 100, 101, 102, 103, etc.\ 150, 151, 152, 153, etc.\ including all 
integers through 200-500; 500-1,000, and the like. 

The polynucleotides of the present invention, or fragments thereof, 
regardless of the length of the coding sequence itself, may be combined with other DNA 

25 sequences, such as promoters, polyadenylation signals, additional restriction enzyme sites, 
multiple cloning sites, other coding segments, and the like, such that their overall length 
may vary considerably. It is therefore contemplated that a nucleic acid fragment of almost 
any length may be employed, with the total length preferably being limited by the ease of 
preparation and use in the intended recombinant DNA protocol. For example, illustrative 
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DNA segments with total lengths of about 10,000, about 5000, about 3000, about 2,000, 
about 1,000, about 500, about 200, about 100, about 50 base pairs in length, and the like, 
(including all intermediate lengths) are contemplated to be useful in many implementations 
of this invention. 

5 In other embodiments, the present invention is directed to polynucleotides 

that are capable of hybridizing under moderately stringent conditions to a polynucleotide 
sequence provided herein, or a fragment thereof, or a complementary sequence thereof 
Hybridization techniques are well known in the art of molecular biology. For purposes of 
illustration, suitable moderately stringent conditions for testing the hybridization of a 

10 polynucleotide of this invention with other polynucleotides include prewashing in a 
solution of 5 X SSC, 0.5% SDS, 1,0 mM EDTA (pH 8.0); hybridizmg at 50°C-65^C, 5 X 
SSC, overnight; followed by washing twice at 65°C for 20 minutes with each of 2X, 0.5X 
and 0.2X SSC containing 0.1% SDS. 

Moreover, it will be appreciated by those of ordinary skill in the art that, as 

15 a resuh of the degeneracy of the genetic code, there are many nucleotide sequences that 
encode a polypeptide as described herein. Some of these polynucleotides bear minimal 
homology to the nucleotide sequence of any native gene. Nonetheless, polynucleotides that 
vary due to differences in codon usage are specifically contemplated by the present 
invention. Further, alleles of the genes comprising the polynucleotide sequences provided 

20 herein are within the scope of the present invention. Alleles are endogenous genes that are 
altered as a result of one or more mutations, such as deletions, additions and/or 
substitutions of nucleotides. The resulting mRNA and protein may, but need not, have an 
altered structure or function. Alleles may be identified using standard techniques (such as 
hybridization, amplification and/or database sequence comparison), 

25 Microarrav Analyses 

Polynucleotides that are suitable for detection according to the methods of 
the present invention may be identified, as described in more detail below, by screening a 
microarray of cDNAs for tissue and/or tumor-associated expression {e.g., expression that is 
at least two-fold greater in a tumor than in normal tissue, as determined using a 
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representative assay provided herein). Such screens may be performed, for example, using 
a Synteni microarray (Palo Alto, CA) according to the manufacturer's instructions (and 
essentially as described by Schena et al, Proc. Natl Acad Sci USA 93:10614-10619 
(1996) and Heller etaL, Proa Natl Acad Set USA 94:2150-2155 (1997)). 
5 Microarray is an effective method for evaluating large numbers of genes but 

due to its limited sensitivity it may not accurately determine the absolute tissue distribution 
of low abimdance genes or may underestimate the degree of overexpression of more 
abundant genes due to signal saturation. For those genes showing overexpression by 
microarray expression profiling, further analysis was performed using quantitative RT-PCR 
10 based on Taqman™ probe detection, which comprises a greater dynamic range of 
sensitivity. Several different panels of normal and tumor tissues, distant metastases and cell 
lines were used for this purpose. 

Quantitative Real-time Polymerase Chain Reaction 

Suitable polynucleotides according to the present invention may be further 
15 characterized or, ahematively, originally identified by employing a quantitative PGR 
methodology such as, for example, the Real-time PGR methodology. By this methodology, 
tissue and/or tumor samples, such as, e.g., metastatic tumor samples, may be tested along 
side the corresponding normal tissue sample and/or a panel of unrelated normal tissue 
samples. 

20 Real-time PGR (see Gibson et al., Genome Research 6:995-1001, 1996; 

Held et al.. Genome Research 5:986-994, 1996) is a technique that evaluates the level of 
PGR product accumulation during amplification. This technique permits quantitative 
evaluation of mRNA levels in multiple samples. Briefly, mRNA is extracted from tumor 
and normal tissue and cDNA is prepared using standard techniques. 

25 Real-time PGR may, for example, be performed either on the ABI 7700 

Prism or on a GeneAmp® 5700 sequence detection system (PE Biosystems, Foster Gity, 
GA). The 7700 system uses a forward and a reverse primer in combination with a specific 
probe with a 5' fluorescent reporter dye at one end and a 3' quencher dye at the other end 
(Taqman"^^^). When the Real-time PGR is performed using Taq DNA polymerase with 5'- 
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3' nuclease activity, the probe is cleaved and begins to fluoresce allowing the reaction to be 
monitored by the increase in fluorescence (Real-time). The 5700 system uses SYBR® 
green, a fluorescent dye, that only binds to double stranded DNA, and the same forward 
and reverse primers as the 7700 instrument Matching primers and fluorescent probes may 
5 be designed accordmg to the primer express program (PE Biosystems, Foster City, CA). 
Optimal concentrations of primers and probes are initially determined by those of ordinary 
skill in the art. Control (e,g., P-actin) primers and probes may be obtained commercially 
from, for example, Perkin Elmer/Applied Biosystems (Foster City, CA). 

To quantitate the amount of specific RNA in a sample, a standard curve is 

10 generated using a plasmid containing the gene of interest. Standard curves are generated 
using the Ct values determined in the real-time PCR, which are related to the initial cDNA 
concentration used in the assay. Standard dilutions ranging from 10-10^ copies of the gene 
of interest are generally sufficient. In addition, a standard curve is generated for the control 
sequence. This permits standardization of initial RNA content of a tissue sample to the 

1 5 amount of control for compmson purposes. 

In accordance with the above, and as described further below, the present 
invention provides the illustrative breast tissue- and/or tumor-specific polynucleotides 
mammaglobin, lipophilin B, GABAtt (B899P), B726P, B51 IS, B533S, B305D and B311D 
having sequences set forth in SEQ ID NO: 1, 3, 5-7, 11, 13, 15, 17, 19-24, 30, 32, and 73- 

20 76 illustrative polypeptides encoded thereby having amino acid sequences set forth in SEQ 
ID NO: 2, 4, 8-10, 12, 14, 16, 18, 25-29 and 31 and 77 that may be suitably employed in 
the detection of cancer, more specifically, breast cancer. 

The methods disclosed herein will also permit the identification of 
additional and/or alternative polynucleotides that are suitable for the detection of a wide 

25 range of cancers including, but not limited to, prostate cancer, breast cancer, colon cancer, 
ovarian cancer, lung cancer head & neck cancer, lymphoma, leukemia, melanoma, liver 
cancer, gastric cancer, kidney cancer, bladder cancer, pancreatic cancer and endometrial 
cancer. 
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Methodologies for the Detection of Cancer 

In general, a cancer cell may be detected in a patient based on the presence 
of one or more polynucleotides within cells of a biological sample (for example, blood, 
lymph nodes, bone marrow, sera, sputum, urine and/or tumor biopsies) obtained from the 
5 patient. In other words, such polynucleotides may be used as markers to indicate the 
presence or absence of a cancer such as, e.g., breast cancer. 

As discussed in further detail herein, the present invention achieves these 
and other related objectives by providing a methodology for the simultaneous detection of 
more than one polynucleotide, the presence of which is diagnostic of the presence of cancer 

10 cells in a biological sample. Each of the various cancer detection methodologies disclosed 
herein have in common a step of hybridizing one or more oligonucleotide primers and/or 
probes, the hybridization of which is demonstrative of the presence of a tumor- and/or 
tissue-specific polynucleotide. Depending on the precise application contemplated, it may 
be preferred to employ one or more intron-spaiming oligonucleotides that are inoperative 

15 against polynucleotide of genomic DNA and, thus, these oligonucleotides are effective in 
substantially reducing and/or eliminating the detection of genomic DNA in the biological 
sample. 

Further disclosed herein are methods for enhancing the sensitivity of these 
detection methodologies by subjecting the biological samples to be tested to one or more 

20 cell capture and/or cell depletion methodologies. 

By certain embodiments of the present invention, the presence of a cancer 
cell m a patient may be determined by employing the following steps: (a) obtaining a 
biological sample from said patient; (b) contacting said biological sample with a first 
oligonucleotide that hybridizes to a first polynucleotide said first polynucleotide selected 

25 from the group consisting of polynucleotides depicted in SEQ ID NO:73 and SEQ ID 
NO:74; (c) contacting said biological sample with a second oligonucleotide that hybridizes 
to a second polynucleotide selected from the group consisting of SEQ ID NO: 1, 3, 5-7, 11, 
13, 15, 17, 19-24, 30, 32, and 75; (d) detecting in said sample an amount of a 
polynucleotide that hybridizes to at least one of the oligonucleotides; and (e) comparing the 
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amount of the polynucleotide that hybridizes to said oligonucleotide to a predetermined 
cut-off value, and therefrom determining the presence or absence of a cancer in the patient. 

Alternative embodiments of the present invention provide methods wherein 
the presence of a cancer cell in a patient is determined by employing the steps of: (a) 
5 obtaining a biological sample from said patient; (b) contacting said biological sample v^th 
a first oligonucleotide that hybridizes to a first polynucleotide said first polynucleotide 
depicted in SEQ ID NO:76; (c) contacting said biological sample with a second 
oligonucleotide that hybridizes to a second polynucleotide selected from the group 
consisting of SEQ ID NO: 1, 3, 5-7, 11, 13, 15, 17, 19-24, 30, 32, and 75; (d) detecting in 

10 said sample an amount of a polynucleotide that hybridizes to at least one of the 
oligonucleotides; and (e) comparing the amount of the polynucleotide that hybridizes to 
said oligonucleotide to a predetermined cut-off value, and therefrom determining the 
presence or absence of a cancer in the patient. 

Other embodiments of the present invention provide methods for 

15 determining the presence or absence of a cancer in a patient. Such methods comprise the 
steps of: (a) obtaining a biological sample from said patient; (b) contacting said biological 
sample obtained from a patient with a first oligonucleotide that hybridizes to a 
polynucleotide sequence selected from the group consisting of polynucleotides depicted in 
SEQ ID NO:73, SEQ ID NO:74 and SEQ ID NO:76; (c) contacting said biological sample 

20 with a second oligonucleotide that hybridizes to a polynucleotide as depicted in SEQ ID 
NO:75; (d) contacting said biological sample with a third oligonucleotide that hybridizes to 
a polynucleotide selected from the group consisting of polynucleotides depicted in SEQ ID 
N0:5, SEQ ID N0:6 and SEQ ID N0:7;(e) contacting said biological sample with a 
fourth oligonucleotide that hybridizes to a polynucleotide selected from the group 

25 consisting of polynucleotides depicted in SEQ ID N0:13, SEQ ID N0:15, SEQ ID N0:17, 
SEQ ID NO: 19, SEQ ID NO:20, SEQ ID N0:21, SEQ ID NO:22, SEQ ID NO:23 and SEQ 
ID NO:24; (f) detecting in said biological sample an amount of a polynucleotide that 
hybridizes to at least one of said oligonucleotides; and (g) comparing the amount of 
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polynucleotide that hybridizes to the oligonucleotide to a predetermined cut-off value, and 
therefrom determining the presence or absence of a cancer in the patient. 

To permit hybridization under assay conditions, oligonucleotide primers and 
probes should comprise an oligonucleotide sequence that has at least about 60%, preferably 
5 at least about 75% and more preferably at least about 90%, identity to a portion of a 
polynucleotide encoding a breast tumor protein that is at least 10 nucleotides, and 
preferably at least 20 nucleotides, in length. Preferably, oligonucleotide primers hybridize 
to a polynucleotide encoding a polypeptide described herein under moderately stringent 
conditions, as defined above. Oligonucleotide primers which may be usefully employed in 

10 the diagnostic methods described herein preferably are at least 10-40 nucleotides in length. 
In a preferred embodiment, the oligonucleotide primers comprise at least 10 contiguous 
nucleotides, more preferably at least 15 contiguous nucleotides, of a DNA molecule having 
a sequence recited in SEQ ID NO: 1, 3, 5-7, 11, 13, 15, 17, 19-24, 30, 32 and 73-76. 
Techniques for both PGR based assays and hybridization assays are well known in the art 

15 {see, for example, Mullis et aL, Cold Spring Harbor Symp. Quant BioL, 57:263, 1987; 
Eriich ed., PCR Technology, Stockton Press, NY, 1989), 

The present invention also provides amplification-based methods for 
detecting the presence of a cancer cell in a patient. Exemplary methods comprise the steps 
of (a) obtaining a biological sample from a patient; (b) contacting the biological sample 

20 with a first oligonucleotide pair the first pair comprising a jQrst oligonucleotide and a 
second oligonucleotide wherein the first oligonucleotide and the second oligonucleotide 
hybridize to a first polynucleotide and the complement thereof, respectively; (c) contacting 
the biological sample with a second oligonucleotide pair the second pair comprising a third 
oligonucleotide and a fourth oligonucleotide wherein the third and the fourth 

25 oligonucleotide hybridize to a second polynucleotide and the complement thereof, 
respectively, and wherein the first polynucleotide is unrelated in nucleotide sequence to the 
second polynucleotide; (d) amplifying the first polynucleotide and the second 
polynucleotide; and (e) detecting the amplified first polynucleotide and the amplified 
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second polynucleotide; wherein the presence of the amplified first polynucleotide or the 
amplified second polynucleotide indicates the presence of a cancer cell in the patient. 

Methods according to the present invention are suitable for identifying 
polynucleotides obtained from a wide variety of biological sample such as, for example, 
5 blood, serum, lymph node, bone marrow, sputum, urine and tumor biopsy sample. In 
certain preferred embodiments, the biological sample is either blood, a lymph node or bone 
marrow. In other embodiments of the present invention, the lymph node may be a sentinel 
lymph node. 

It will be apparent that the present methods may be employed in the 
10 detection of a wide variety of cancers. Exemplary cancers include, but are not limited to, 
prostate cancer, breast cancer, colon cancer, ovarian cancer, lung cancer head & neck 
cancer, lymphoma, leukemia, melanoma, liver cancer, gastric cancer, kidney cancer, 
bladder cancer, pancreatic cancer and endometrial cancer. 

Certain exemplary embodiments of the present invention provide methods 
15 wherein the polynucleotides to be detected are selected from the group consisting of 
mammaglobin, lipophilm B, GABAti (B899P), B726P, B511S, B533S, B305D and 
B311D, Alternatively and/or additionally, polynucleotides to be detected may be selected 
from the group consisting of those depicted in SEQ ID NO:73, SEQ ID NO:74, SEQ ID 
NO:75, SEQ ID N0:1, SEQ ID N0:3, SEQ ID N0:5, SEQ ID N0:6, SEQ ID N0:7, SEQ 
20 ID N0:11, SEQ ID N0:13, SEQ ID N0:15, SEQ ID N0:17, SEQ ID N0:19, SEQ ID 
NO:20, SEQ ID N0:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:30, 
SEQ ID NO:32, and SEQ ID NO:76. 

Suitable exemplary oligonucleotide probes and/or primers that may be used 
according to the methods of the present invention are disclosed herein by SEQ ID N0s:33- 
25 35 and 63-72. In certain preferred embodiments that eliminate the background detection 
of genomic DNA, the oligonucleotides may be intron spanning oligonucleotides. 
Exemplary intron spanning oligonucleotides suitable for the detection of various 
polynucleotides disclosed herein are depicted in SEQ ID NO:36-62. 
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Depending on the precise application contemplated, the artisan may prefer to detect 
the tissue- and/or tumor-specific polynucleotides by detecting a radiolabel and detecting a 
fluorophore. More specifically, the oligonucleotide probe and/or primer may comprises a 
detectable moiety such as, for example, a radiolabel and/or a fluorophore. 
5 Alternatively or additionally, methods of the present invention may also 

comprise a step of fractionation prior to detection of the tissue- and/or tumor-specific 
polynucleotides such as, for example, by gel electrophoresis. 

In other embodiments, methods described herein may be used as to monitor 
the progression of cancer. By these embodiments, assays as provided for the diagnosis of a 

10 cancer may be performed over time, and the change in the level of reactive polypeptide(s) 
or polynucleotide(s) evaluated. For example, the assays may be performed every 24-72 
hours for a period of 6 months to 1 year, and thereafter performed as needed. In general, a 
cancer is progressing in those patients in whom the level of polypeptide or polynucleotide 
detected increases over time. In contrast, the cancer is not progressing when the level of 

1 5 reactive polypeptide or polynucleotide either remains constant or decreases with time. 

Certain in vivo diagnostic assays may be performed directly on a tumor. 
One such assay involves contacting tumor cells with a binding agent. The bound binding 
agent may then be detected directly or indirectly via a reporter group. Such binding agents 
may also be used in histological applications. Altematively, polynucleotide probes may be 

20 used within such applications. 

As noted above, to improve sensitivity, multiple breast tumor protein 
markers may be assayed within a given sample. It will be apparent that binding agents 
specific for different proteins provided herein may be combined within a single assay. 
Further, multiple primers or probes may be used concurrently. The selection of tumor 

25 protein markers may be based on routine experiments to determine combinations that 
results in optimal sensitivity. In addition, or altematively, assays for tumor proteins 
provided herein may be combined with assays for other known tumor antigens. 
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Cell Enrichment 

In other aspects of the present invention, cell capture technologies may be 
used prior to polynucleotide detection to improve the sensitivity of the various detection 
5 methodologies disclosed herein. 

Exemplary cell enrichment methodologies employ immunomagnetic beads 
that are coated with specific monoclonal antibodies to surface cell markers^ or tetrameric 
antibody complexes, may be used to first enrich or positively select cancer cells in a 
sample. Various commercially available kits may be used, including Dynabeads® 
10 Epithelial Enrich (Dynal Biotech, Oslo, Norway), StemSep'^'^ (StemCell Technologies, 
Inc., Vancouver, BC), and RosetteSep (StemCell Technologies). The skilled artisan will 
recognize that other readily available methodologies and kits may also be suitably 
employed to enrich or positively select desired cell populations. 

Dynabeads® Epithelial Enrich contains magnetic beads coated with mAbs 
15 specific for two glycoprotein membrane antigens expressed on normal and neoplastic 
epithelial tissues. The coated beads may be added to a sample and the sample then applied 
to a magnet, thereby capturing the cells bound to the beads. The unwanted cells are 
washed away and the magnetically isolated cells eluted from the beads and used in further 
analyses. 

20 RosetteSep can be used to enrich cells directly from a blood sample and 

consists of a cocktail of tetrameric antibodies that target a variety of unwanted cells and 
crosslinks them to glycophorin A on red blood cells (RBC) present in the sample, forming 
rosettes. When centrifuged over FicoU, targeted cells pellet along with the free RBC. 

The combination of antibodies in the depletion cocktail determines which 

25 cells will be removed and consequently which cells will be recovered. Antibodies that are 
available include, but are not limited to: CD2, CDS, CD4, CDS, CD8, CDIO, CDllb, 
CD14, CD15, CD16, CD19, CD20, CD24, CD25, CD29, CD33, CD34, CD36, CD38, 
CD41, CD45, CD45RA, CD45RO, CD56, CD66B, CD66e, HLA-DR, IgE, and TCRap. 
Additionally, it is contemplated in the present invention that mAbs specific for breast 
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tumor antigens, can be developed and used in a similar manner. For example, mAbs that 
bind to tumor-specific cell surface antigens may be conjugated to magnetic beads, or 
formulated in a tetrameric antibody complex, and used to enrich or positively select 
metastatic breast tumor cells from a sample. 
5 Once a sample is enriched or positively selected, cells may be further 

analysed. For example, the cells may be lysed and RNA isolated. RNA may then be 
subjected to RT-PCR analysis using breast tumor-specific multiplex primers in a Real-time 
PGR assay as described herein. 

In another aspect of the present invention, cell capture technologies may be 
10 used in conjunction with Real-time PGR to provide a more sensitive tool for detection of 
metastatic cells expressing breast tumor antigens. Detection of breast cancer cells in bone 
marrow samples, peripheral blood, and small needle aspiration samples is desirable for 
diagnosis and prognosis in breast cancer patients. 

Probes and Primers 

15 As noted above and as described in further detail herein, certain methods, 

compositions and kits according to the present invention utilize two or more 
oligonucleotide primer pairs for the detection of cancer. The ability of such nucleic acid 
probes to specifically hybridize to a sequence of interest will enable them to be of use in 
detecting the presence of complementary sequences in a biological sample. 

20 Altematively, in other embodiments, the probes and/or primers of the 

present invention may be employed for detection via nucleic acid hybridization. As such, it 
is contemplated that nucleic acid segments that comprise a sequence region of at least 
about 15 nucleotide long contiguous sequence that has the same sequence as, or is 
complementary to, a 15 nucleotide long contiguous sequence of a polynucleotide to be 

25 detected will find particular utility. Longer contiguous identical or complementary 
sequences, e,g,, those of about 20, 30, 40, 50, 100, 200, 500, 1000 (including all 
intermediate lengths) and even up to full length sequences will also be of use in certain 
embodiments. 
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Oligonucleotide primers having sequence regions consisting of contiguous 
nucleotide stretches of 10-14, 15-20, 30, 50, or even of 100-200 nucleotides or so 
(including intermediate lengths as v^ell), identical or complementary to a polynucleotide to 
be detected , are particularly contemplated as primers for use in amplification reactions 
5 such as, e.g., the polymerase chain reaction (PGR™).. This would allow a polynucleotide 
to be analyzed, both in diverse biological samples such as, for example, blood, lymph 
nodes and bone marrow. . 

The use of a primer of about 15-25 nucleotides in length allows the 
formation of a duplex molecule that is both stable and selective. Molecules having 

10 contiguous complementary sequences over stretches greater than 15 bases in length are 
generally preferred, though, in order to increase stability and selectivity of the hybrid, and 
thereby improve the quality and degree of specific hybrid molecules obtained. One will 
generally prefer to design primers having gene-complementary stretches of 15 to 25 
contiguous nucleotides, or even longer where desired. 

15 Primers may be selected from any portion of the polynucleotide to be 

detected. All that is required is to review the sequence, such as those exemplary 
polynucleotides set forth in SEQ ID NO: 1, 3, 5-7, 11, 13, 15, 17, 19-24, 30, 32, 73-75 
(Figures 3-6, respectively) and SEQ ID NO: 76 (lipophilin B) or to any continuous portion 
of the sequence, from about 15-25 nucleotides in length up to and including the full length 

20 sequence, that one wishes to utilize as a primer. The choice of primer sequences may be 
govemed by various factors. For example, one may wish to employ primers from towards 
the termini of the total sequence. The exemplary primers disclosed herein may optionally 
be used for their ability to selectively form duplex molecules with complementary stretches 
of the entire polynucleotide of interest such as those set forth in SEQ ID NO: 1, 3, 5-7, 11, 

25 13, 15, 17, 19-24, 30, 32, 73-75 (Figures 3-6, respectively), and SEQ ID NO:76 (lipophilin 
B). 

The present invention ftirther provides the nucleotide sequence of various 
exemplary oligonucleotide primers and probes, set forth in SEQ ID NO: 33-71, that may be 
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used, as described in further detail herein, according to the methods of the present 
invention for the detection of cancer. 

Oligonucleotide primers according to the present invention may be readily 
prepared routinely by methods commonly available to the skilled artisan including, for 
5 example, directly synthesizing the primers by chemical means, as is commonly practiced 
using an automated oligonucleotide synthesizer. Depending on the application envisioned, 
one will typically desire to employ varying conditions of hybridization to achieve varying 
degrees of selectivity of probe towards target sequence. For applications requiring high 
selectivity, one will typically desire to employ relatively stringent conditions to form the 
10 hybrids, e.g., one will select relatively low salt and/or high temperature conditions, such as 
provided by a sah concentration of from about 0.02 M to about 0.15 M salt at temperatures 
of from about 50®C to about 70^C. Such selective conditions tolerate little, if any, 
mismatch between the probe and the template or target strand, and would be particularly 
suitable for isolating related sequences. 

15 

Polvnucleotide Amplification Techniques 

Each of the specific embodiments outlined herein for the detection of cancer 
has in common the detection of a tissue- and/or tumor-specific polynucleotide via the 
hybridization of one or more oligonucleotide primers and/or probes. Depending on such 

20 factors as the relative number of cancer cells present in the biological sample and/or the 
level of polynucleotide expression within each cancer cell, it may be preferred to perform 
an amplification step prior to performing the steps of detection. For example, at least two 
oligonucleotide primers may be employed in a polymerase chain reaction (PGR) based 
assay to amplify a portion of a breast tumor cDNA derived from a biological sample, 

25 wherein at least one of the oligonucleotide primers is specific for (Le,, hybridizes to) a 
polynucleotide encoding the breast tumor protein. The amplified cDNA may optionally be 
subjected to a fractionation step such as, for example, gel electrophoresis. 

A number of template dependent processes are available to amplify the 
target sequences of interest present in a sample. One of the best known amplification 
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methods is the polymerase chain reaction (PGR™) which is described in detail in U.S. 
Patent Nos. 4,683,195, 4,683,202 and 4,800,159, each of which is incorporated herein by 
reference in its entirety. Briefly, in PGR™, two primer sequences are prepared which are 
complementary to regions on opposite complementary strands of the target sequence. An 
5 excess of deoxynucleoside triphosphates is added to a reaction mixture along with a DNA 
polymerase (e.g., Taq polymerase). If the target sequence is present in a sample, the 
primers will bind to the target and the polymerase will cause the primers to be extended 
along the target sequence by adding on nucleotides. By raising and lowering the 
temperature of the reaction mixture, the extended primers will dissociate from the target to 

10 form reaction products, excess primers will bind to the target and to the reaction product 
and the process is repeated. Preferably reverse transcription and PGR™ amplification 
procedure may be performed in order to quantify the amount of mRNA amplified. 
Polymerase chain reaction methodologies are well known in the art. 

One preferred methodology for polynucleotide amplification employs RT- 

15 PGR, in which PGR is applied in conjunction with reverse transcription. Typically, RNA is 
extracted fi*om a biological sample, such as blood, serum, lymph node, bone marrow, 
sputum, urine and tumor biopsy samples, and is reverse transcribed to produce cDNA 
molecules. PGR amplification using at least one specific primer generates a cDNA 
molecule, which may be separated and visualized using, for example, gel electrophoresis. 

20 Amplification may be performed on biological samples taken firom a patient and firom an 
individual who is not afflicted with a cancer. The amplification reaction may be performed 
on several dilutions of cDNA spanning two orders of magnitude. A two-fold or greater 
increase in expression in several dilutions of the test patient sample as compared to the 
same dilutions of the non-cancerous sample is typically considered positive. 

25 Any of a variety of commercially available kits may be used to perform the 

amplification step. One such ampHfication technique is inverse PGR {see Triglia et al, 
Nucl Acids Res. 75:8186, 1988), which uses restriction enzymes to generate a fragment in 
the known region of the gene. The fragment is then circularized by intramolecular ligation 
and used as a template for PGR with divergent primers derived from the known region. 
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Within an alternative approach, sequences adjacent to a partial sequence may be retrieved 
by amplification with a primer to a linker sequence and a primer specific to a known 
region. The amplified sequences are typically subjected to a second round of amplification 
with the same linker primer and a second primer specific to the known region. A variation 
5 on this procedure, which employs two primers that initiate extension in opposite directions 
fi:om the known sequence, is described in WO 96/38591. Another such technique is known 
as "rapid amplification of cDNA ends" or RACE. This technique involves the use of an 
internal primer and an extemal primer, which hybridizes to a polyA region or vector 
sequence, to identify sequences that are 5' and 3' of a known sequence. Additional 

10 techniques include capture PGR (Lagerstrom et al, PCR Methods Applic. 7:111-19, 1991) 
and walking PCR (Parker et al, Nucl Acids. Res. 7P:3055-60, 1991). Other methods 
employing amplification may also be employed to obtain a fiiU length cDNA sequence. 

Another method for amplification is the ligase chain reaction (referred to as 
LCR), disclosed in Eur. Pat. Appl. Publ. No. 320,308 (specifically incorporated herein by 

15 reference in its entirety). In LCR, two complementary probe pairs are prepared, and in the 
presence of the target sequence, each pair will bind to opposite complementary strands of 
the target such that they abut. In the presence of a ligase, the two probe pairs will link to 
form a single unit. By temperature cycling, as in PCR™, bound ligated units dissociate 
from the target and then serve as "target sequences" for ligation of excess probe pairs. U.S. 

20 Patent No. 4,883,750, incorporated herein by reference in its entirety, describes an 
altemative method of amplification similar to LCR for binding probe pairs to a target 
sequence. 

Qbeta Replicase, described in PCT Intl. Pat. Appl. Publ. No. 
PCTAJS87/00880, incorporated herein by reference in its entirety, may also be used as still 
25 another amplification method in the present invention. In this method, a replicative 
sequence of RNA that has a region complementary to that of a target is added to a sample 
in the presence of an RNA polymerase. The polymerase will copy the replicative sequence 
that can then be detected. 
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An isothermal amplification method, in which restriction endonucleases and 
ligases are used to achieve the amplification of target molecules that contain nucleotide 
5'-[a-thio]triphosphates in one strand of a restriction site (Walker et al, 1992, incorporated 
herein by reference in its entirety), may also be usefiil in the amplification of nucleic acids 
5 in the present invention. 

Strand Displacement Amplification (SDA) is another method of carrying 
out isothermal amplification of nucleic acids which involves multiple rounds of strand 
displacement and synthesis, le. nick translation. A similar method, called Repair Chain 
Reaction (RCR) is another method of amplification which may be useful in the present 

10 invention and is involves annealing several probes throughout a region targeted for 
amplification, followed by a repair reaction in which only two of the four bases are present. 
The other two bases can be added as biotinylated derivatives for easy detection. A similar 
approach is used in SDA. 

Sequences can also be detected using a cyclic probe reaction (CPR). In 

15 CPR, a probe having a 3' and 5' sequences of non-target DNA and an internal or "middle" 
sequence of the target protein specific RNA is hybridized to DNA which is present in a 
sample. Upon hybridization, the reaction is treated with RNaseH, and the products of the 
probe are identified as distinctive products by generating a signal that is released after 
digestion. The original template is annealed to another cycling probe and the reaction is 

20 repeated. Thus, CPR involves amplifying a signal generated by hybridization of a probe to 
a target gene specific expressed nucleic acid. 

Still other amplification methods described in Great Britain Pat. Appl. No. 2 
202 328, and in PCT Intl. Pat. Appl. Publ. No. PCT/US89/01025, each of which is 
incorporated herein by reference in its entirety, may be used in accordance with the present 

25 invention. In the former application, "modified" primers are used in a PCR-like, template 
and enzyme dependent synthesis. The primers may be modified by labeling with a capture 
moiety (e.g., biotin) and/or a detector moiety (e.g., enzyme). In the latter application, an 
excess of labeled probes is added to a sample. In the presence of the target sequence, the 
probe binds and is cleaved catalytically. After cleavage, the target sequence is released 
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intact to be bound by excess probe. Cleavage of the labeled probe signals the presence of 
the target sequence. 

Other nucleic acid amplification procedures include transcription-based 
amplification systems (TAS) (Kwoh et al, 1989; PCT Intl. Pat. Appl. Publ. No. WO 
5 88/10315, incorporated herem by reference in its entirety), including nucleic acid sequence 
based amplification (NASBA) and 3SR. In NASBA, the nucleic acids can be prepared for 
amplification by standard phenol/chloroform extraction, heat denaturation of a sample, 
treatment with lysis buffer and minispin columns for isolation of DNA and RNA or 
guanidinium chloride extraction of RNA. These amplification techniques involve 

10 annealing a primer that has sequences specific to the target sequence. Following 
polymerization, DNA/RNA hybrids are digested vrith RNase H while double stranded 
DNA molecules are heat-denatured again. In either case the single stranded DNA is made 
fiiUy double stranded by addition of second target-specific primer, followed by 
polymerization. The double stranded DNA molecules are then multiply transcribed by a 

15 polymerase such as T7 or SP6. In an isothermal cyclic reaction, the RNAs are reverse 
transcribed into DNA, and transcribed once again with a polymerase such as T7 or SP6. 
The resulting products, whether truncated or complete, indicate target-specific sequences. 

Eur. Pat. Appl. Publ No. 329,822, incorporated herein by reference in its 
entirety, disclose a nucleic acid amplification process involving cyclically synthesizing 

20 single-stranded RNA ("ssRNA"), ssDNA, and double-stranded DNA (dsDNA), which may 
be used in accordance with the present invention. The ssRNA is a first template for a first 
primer oligonucleotide, which is elongated by reverse transcriptase (RNA-dependent DNA 
polymerase). The RNA is then removed fi:om resulting DNA:RNA duplex by the action of 
ribonuclease H (RNase H, an RNase specific for RNA in a duplex with either DNA or 

25 RNA). The resuhant ssDNA is a second template for a second primer, which also includes 
the sequences of an RNA polymerase promoter (exemplified by T7 RNA polymerase) 5' to 
its homology to its template. This primer is then extended by DNA polymerase 
(exemplified by the large "Klenow" fragment of E. coli DNA polymerase I), resulting as a 
double-stranded DNA ("dsDNA") molecule, having a sequence identical to that of the 
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original RNA between the primers and having additionally, at one end, a promoter 
sequence. This promoter sequence can be used by the appropriate RNA polymerase to 
make many RNA copies of the DNA. These copies can then re-enter the cycle leading to 
very swift amplification. With proper choice of enzymes, this amplification can be done 

5 isothermally without addition of enzymes at each cycle. Because of the cyclical nature of 
this process, the starting sequence can be chosen to be in the form of either DNA or RNA. 

PCT Intl. Pat. Appl. Publ. No. WO 89/06700, incorporated herein by 
reference in its entirety, disclose a nucleic acid sequence amplification scheme based on the 
hybridization of a promoter/primer sequence to a target single-stranded DNA ("ssDNA") 

10 followed by transcription of many RNA copies of the sequence. This scheme is not cyclic; 
i.e. new templates are not produced from the resultant RNA transcripts. Other 
amplification methods include "RACE" (Frohman, 1990), and "one-sided PGR" (Ohara, 
1 989) which are well-known to those of skill in the art. 



Compositions and Kits for the Detection of Cancer 

15 The present invention further provides kits for use within any of the above 

diagnostic methods. Such kits typically comprise two or more components necessary for 
performing a diagnostic assay. Components may be compounds, reagents, containers 
and/or equipment. For example, one container within a kit may contain a monoclonal 
antibody or fragment thereof that specifically binds to a breast tumor protein. Such 

20 antibodies or fragments may be provided attached to a support material, as described 
above. One or more additional containers may enclose elements, such as reagents or 
buffers, to be used in the assay. Such kits may also, or altematively, contain a detection 
reagent as described above that contains a reporter group suitable for direct or indirect 
detection of antibody binding, 

25 The present invention also provides kits that are suitable for performing the 

detection methods of the present invention. Exemplary kits comprise oligonucleotide 
primer pairs each one of which specifically hybridizes to a distinct polynucleotide. Within 
certain embodiments, kits according to the present invention may also comprise a nucleic 
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acid polymerase and suitable buffer. Exemplary oligonucleotide primers suitable for kits 
of the present invention are disclosed herein by SEQ ID NO: 33-7L Exemplary 
polynucleotides suitable for kits of the present invention are disclosed in SEQ ID NOiTS, 
SEQ ID NO:74, SEQ ID NO:75, SEQ ID N0:1, SEQ ID NO:3, SEQ ID N0:5, SEQ ID 

5 N0:6, SEQ ID N0:7, SEQ ID N0:11, SEQ ID N0:13, SEQ ID N0:15, SEQ ID N0:17, 
SEQ ID N0:19, SEQ ID NO:20, SEQ ID N0:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID 
NO:24, SEQ ID NO:30, SEQ ID NO:32, and lipophilin B. 

Alternatively, a kit may be designed to detect the level of mRNA encoding a 
breast tumor protein in a biological sample. Such kits generally comprise at least one 

10 oligonucleotide probe or primer, as described above, that hybridizes to a polynucleotide 
encoding a breast tumor protein. Such an oligonucleotide may be used, for example, 
within a PGR or hybridization assay. Additional components that may be present within 
such kits include a second oligonucleotide and/or a diagnostic reagent or container to 
facilitate the detection of a polynucleotide encoding a breast tumor protein. 

15 In other related aspects, the present invention further provides compositions 

useful in the methods disclosed herein. Exemplary compositions comprise two or more 
oligonucleotide primer pairs each one of which specifically hybridizes to a distinct 
polynucleotide. Exemplary oligonucleotide primers suitable for compositions of the 
present invention are disclosed herein by SEQ ID NO: 33-71. Exemplary polynucleotides 

20 suitable for compositions of the present invention are disclosed in SEQ ID NO:73, SEQ ID 
NO:74, SEQ ID NO:75, SEQ ID N0:1, SEQ ID N0:3, SEQ ID N0:5, SEQ ID N0:6, SEQ 
ID N0:7, SEQ ID N0:11, SEQ ID N0:13, SEQ ID N0:15, SEQ ID N0:17, SEQ ID 
N0:19, SEQ ID NO:20, SEQ ID N0:21, SEQ ID N0:22, SEQ ID NO:23, SEQ ID NO:24, 
SEQ ID NO:30, SEQ ID NO:32, and lipophilin B. 

25 The following Examples are offered by way of illustration and not by way 

of limitation. 
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EXAMPLES 

EXAMPLE 1 
Differential Display 

5 This example discloses the use of differential display to enrich for 

polynucleotides that are over-expressed in breast tumor tissues. 

Differential display was performed as described in the literature {see, e.g,, 
Liang, P. et aL, Science 257:967-971 (1993), incorporated herein by reference in its 
entirety) with the following modifications: (a) PGR amplification products were visualized 
10 on silver stained gels (b) genetically matched pairs of tissues were used to eliminate 
polymorphic variation (c) two different dilutions of cDNA were used as template to 
eliminate any dilutional effects (see, Mou, E. et al., Biochem Biophy Res Commun, 
199 :564-569 (1994) , incorporated herein by reference in its entirety). 

15 EXAMPLE 2 

Preparation of cDNA Subtraction Library 
This example discloses the preparation of a breast tumor cDNA subtraction 
library enriched in breast tumor specific polynucleotides. 

cDNA library subtraction was performed as described with some 

20 modification. See, Hara, T. et al., Blood 84: 189-199 (1994), incorporated herein by 
reference in its entirety. The breast tumor library (tracer) that was made from a pool of 
three breast tumors was subtracted with normal breast library (driver) to identify breast 
tumor specific genes. More recent subtractions utilized 6-10 normal tissues as driver to 
subtract out common genes more efficiently, with an emphasis on essential tissues along 

25 with one "immunological" tissue {e.g., spleen, lymph node, or PBMC), to assist in the 
removal of cDNAs derived from lymphocyte infiltration in tumors. The breast tumor 
specific subtracted cDNA library was generated as follows: driver cDNA library was 
digested with EcoRI, NotI, and Sful (Sful cleaves the vector), filled in with DNA 
polymerase klenow fragment. After phenol-chloroform extraction and ethanol 
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precipitation, the DNA was labeled with Photoprobe biotin and dissolved in H2O. Tracer 
cDNA library was digested with BamHI and Xhol, phenol chloroform extracted, passed 
through Chroma spin-400 columns, ethanol precipitated, and mixed with driver DNA for 
hybridization at Si^'C for 20 hours [long hybridization (LH)]. The reaction mixture was 

5 then subjected to the streptavidin treatment followed by phenol/chloroform extraction for a 
total of four times. Subtracted DNA was precipitated and subjected to a hybridization at 
68°C for 2 hours with driver DNA again [short hybridization (SH)]. After removal of 
biotinylated double-stranded DNA, subtracted cDNA was ligated into BamHI/XhoI site of 
Chloramphenicol resistant pBCSK"^ and transformed into ElectroMax E, coli DHIOB cells 

10 by electroporation to generate subtracted cDNA library. To clone less abundant breast 
tumor specific genes, cDNA library subtraction was repeated by subtracting the tracer 
cDNA library with the driver cDNA library plus abundant cDNAs from primary 
subtractions. This resulted in the depletion of these abundant sequences and the generation 
of subtraction libraries that contain less abundant sequences. 

1 5 To analyze the subtracted cDNA library, plasmid DNA was prepared from 

100-200 independent clones, which were randomly picked from the subtracted library, and 
characterized by DNA sequencing. The determined cDNA and expected amino acid 
sequences for the isolated cDNAs were compared to known sequences using the most 
recent Genbank and human EST databases. 

20 

EXAMPLE 3 

PCR-SUBTRACTION 

This example discloses PGR subtraction to enrich for breast tumor specific 
polynucleotides. 

25 PGR-subtraction was performed essentially as described in the literature. 

See, Diatchenko, L. et al., Proc Natl Acad Sci USA. 93:6025-6030 (1996) and Yang, G.P. 
et aL, Nucleic acids Res. 27:1517-23 (1999), incorporated herein by reference in their 
entirety. Briefly, this type of subtraction works by ligating two different adapters to 
different aliquots of a restriction enzyme digested tester (breast tumor) cDNA sample, 
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followed by mixing of the testers separately with excess driver (without adapters). This 
first hybridization results in normalization of single stranded tester specific cDNA due to 
the second order kinetics of hybridization. These separate hybridization reactions are then 
mixed without denaturation, and a second hybridization performed which produces the 

5 target molecules; double stranded cDNA fragments containing both of the different 
adapters. Two rounds of PGR were performed, which results in the exponential 
amplification of the target population molecules (normalized tester specific cDNAs), while 
other fragments were either unamplified or only amplified in a linear manner. The 
subtractions performed included a pool of breast tumors subtracted with a pool of normal 

10 breast and a pool of breast tumors subtracted with a pool of normal tissues including 
PBMC, brain, pancreas, liver, small intestine, stomach, heart and kidney. 

Prior to cDNA synthesis RNA was treated with DNase I (Ambion) 
in the presence of RNasin (Promega Biotech, Madison, WI) to remove DNA 
contamination. The cDNA for use in real-time PGR tissue panels was prepared using 25|al 

15 Oligo dT (Boehringer-Mannheim) primer with superscript II reverse transcriptase (Gibco 
BRL, Bethesda, MD). 

EXAMPLE 4 

Detection Of Breast Cancer Using Breast-Specific Antigens 
20 The isolation and characterization of the breast-specific antigens B5 1 1 S and 

B533S is described in U.S. Patent Application 09/346,327, filed July 2, 1999, the 
disclosure of which is hereby incorporated by reference in its entirety. The determined 
cDNA sequence for B51 IS is provided in SEQ ID NO: 30, with the corresponding amino 
acid sequence being provided in SEQ ID NO: 31. The determined cDNA sequence for 
25 B533S is provided in SEQ ID NO: 32. The isolation and characterization of the breast- 
specific antigen B726P is described in U.S. Patent Applications 09/285,480, filed April 2, 
1999, and 09/433,826, filed November 3, 1999, the disclosures of which are hereby 
incorporated by reference in their entirety. 
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The determined cDNA sequences for splice variants of B726P are provided 
in SEQ ID NO: 13, 15, 17 and 19-24, with the corresponding amino acid sequences being 
provided in SEQ ID NO: 14, 16, 18 and 25-29. 

The isolation and characterization of the breast-specific antigen B305D 
forms A and C has been described in U.S. Patent Application 09/429,755, filed October 28, 
1999, the disclosure of which is hereby incorporated by reference in its entirety. 
Determined cDNA sequences for B305D isoforms A and C are provided in SEQ ID NO: 1, 

3 and 5-7, with the corresponding amino acid sequences being provided in SEQ ID NO: 2, 

4 and 8-10. 

The isolation and characterization of the breast-specific antigen B311D has 
been described in U.S. Patent Application 09/289,198, filed April 9, 1999, the disclosure of 
which is hereby incorporated by reference in its entirety. The determined cDNA sequence 
for B311D is provided in SEQ ID N0:11, v^th the corresponding amino acid sequence 
being provided in SEQ ID NO: 12. 

cDNA sequences for mammaglobin are provided in Figs. 4 and 5, with the 
cDNA sequence for GABAtc being provided in Fig 6 and are disclosed in SEQ ID NO: 73- 
75, respectively. 

The isolation and characteization of the breast-specific antigen lipophilin B 
has been described in U.S. Patent Application 09/780,842, filed February 8, 2001, the 
disclosure of which is hereby incorporated by reference in its entirety. The determined 
cDNA sequence for lipophilin B is provided in SEQ ID NO:76, with the corresponding 
amino acid sequence being provided in SEQ ID NO:77. The nucleotide sequences of 
several sequence variants of lipophilin B are also described in the 09/780,842 application. 

EXAMPLE 5 
MiCROARRAY ANALYSIS 

This example discloses the use of microarray analyses to identify 
polynucleotides that are at least two-fold overexpressed in breast tumor tissue s^ples as 
compared to normal breast tissue samples. 
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mRNA expression of the polynucleotides of interest was performed as 
follows. cDNA for the different genes was prepared as described above and arrayed on a 
glass slide (Incyte, Palo Alto, CA). The arrayed cDNA was then hybridized with a 1:1 
mixture of Cy3 or Cy5 fluorescent labeled first strand cDNAs obtained from polyA+ RNA 
5 from breast tumors, normal breast and normal tissues and other tumors as described in 
Shalon, D. et al.. Genome Res, 6:639-45 (1996), incorporated herein by reference in its 
entirety. Typically Cy3 (Probe 1) was attached to cDNAs from breast tumors and Cy5 
(Probe 2) to normal breast tissue or other normal tissues. Both probes were allowed to 
compete with the immobilized gene specific cDNAs on the chip, washed then scanned for 

10 fluorescence intensity of the individual Cy3 and Cy5 fluorescence to determine extent of 
hybridization. Data were analyzed using GEMTOOLS software (Incyte, Palo Alto, CA) 
which enabled the overexpression patterns of breast tumors to be compared with normal 
tissues by the ratios of Cy3/Cy5. The fluorescence intensity was also related to the 
expression level of the individual genes. 

15 DNA microarray analyses was used primarily as a screening tool to 

determine tissue/tumor specificity of cDNA's recovered from the differential display, 
cDNA library and PGR subtractions, prior to more rigorous analysis by quantitative RT- 
PCR, northern blotting, and immunohistochemistry. Microarray analysis was performed 
on two microchips. A total of 3603 subtracted cDNA's and 197 differential display 

20 templates were evaluated to identify 40 candidates for further analysis by quantitative PGR, 
From these candidates, several were chosen on the basis of favorable tissue specificity 
profiles, including B305D, B311D, B726P, B511S and B533S, indicating their 
overexpression profiles in breast tumors and/or normal breast versus other normal tissues. 
It was evident that the expression of these genes showed a high degree of specificity for 

25 breast tumors and/or breast tissue. In addition, these genes have in many cases 
complementary expression profiles. 

The two known breast-specific genes, mammaglobin and y-aminobutyrate 
type A receptor n subunit (GABAti) were also subjected to microarray analysis. mRNA 
expression of mammaglobin has been previously described to be upregulated in 
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proliferating breast tissue, including breast tumors. See, (Watson et al.. Cancer Res,, 56: 
860-5 (1996); Watson et al, Cancer Res., 59: 3028-3031 (1999); Watson et al. Oncogene, 
16:817-24 (1998), incorporated herein by reference in their entirety). The GABAtt mRNA 
levels were over-expressed in breast tumors. Previous studies had demonstrated its 
5 overexpression in uterus and to some degree in prostate and lung (Hedblom et al., J Biol. 
Chem. 272:15346-15350 (1997)) but no previous study had indicated elevated levels in 
breast tumors. 

EXAMPLE 6 

1 0 Quantitative Real-time PGR Analysis 

This example discloses the use of quantitative Real-time PGR to conjfirm the 
microarray identification polynucleotide that are at least two-fold overexpressed in breast 
tumor tissue samples as compared to normal breast tissue samples. 

The tumor- and/or tissue-specificity of the polynucleotides identified by the 

15 microarray analyses disclosed herein in Example 5, were confirmed by quantitative PGR 
analyses. Breast metastases, breast tumors, benign breast disorders and normal breast 
tissue along with other normal tissues and tumors were tested in quantitative (Real time) 
PGR. This was performed either on the ABI 7700 Prism or on a GeneAmp® 5700 
sequence detection system (PE Biosystems, Foster Gity, GA). The 7700 system uses a 

20 forward and a reverse primer in combination with a specific probe designed to anneal to 
sequence between the forward and reverse primer. This probe was conjugated at the 5 'end 
with a fluorescent reporter dye and a quencher dye at the other Y end (Taqman™). During 
PGR the Taq DNA polymerase with it's 5'-3' nuclease activity cleaved the probe which 
began to fluoresce, allowing the reaction to be monitored by the increase in fluorescence 

25 (Real-time). Holland et al., Proc Natl Acad Sci USA. 88:7276-7280 (1991). The 5700 
system used SYBR® green, a fluorescent dye, that only binds to double stranded DNA 
(Schneeberger et al., PGR Methods Appl. 4:234-8 (1995)), and the same forward and 
reverse primers as the 7700 instrument. No probe was needed. Matching primers and 
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fluorescent probes were designed for each of the genes according to the Primer Express 
program (PE Biosystems, Foster City, CA). 



Table 1. 



Primer and Probe Sequences for the Genes of Ini 


terest 




Forward Primer 


Reverse primer 


Probe 


Mammaglobin 


TGCCATAGATGA 
ATTGAAGGAATG 
(SEQIDNO:48) 


TGTCATATATTAATT 

GCATAAACACCTCA 
(SEQIDNO:49) 


TCTTAACCAAACGG 

ATGAAACTCTGAGC 

AATG(SEQID 

NO:50) 


B305D-C form 


AAAGCAGATGGT 

GGTTGAGGTT 

(SEQIDNO:39) 


CCTGAGACCAAATG 
GCTTCTTC (SEQ ID 
NO:40) 


ATTCCATGCCGGCT 
GCTTCTTCTG (SEQ 
IDN0:41) 


B311D 


CCGCTTCTGACAA 

CACTAGAGATC 

(SEQIDNO:63) 


CCTATAAAGATGTT 
ATGTACCAAAAATG 
AAGT(SEQIDNO:64) 


CCCCTCCCTCAGGG 
TATGGCCC (SEQ ID 
NO:65) 


B726P 


TCTGGTTTTCTCA 
TTCTTTATTCATT 
TATT (SEQ ID 
NO:42) 


TGCCAAGGAGCGGA 
TTATCT (SEQ ID 
NO:43) 


CAACCACGTGACA 
AACACTGGAATTAC 
AGG (SEQIDNO:44) 


B533S 


CCCTTTCTCACCC 
ACACACTGT (SEQ 
IDNO:66) 


TGCATTCTCTCATAT 
GTGGAAGCT (SEQ ID 
NO:67) 


CCGGGCCTCAGGC 
ATATACTATTCTAC 
TGTCTG (SEQ ID 
NO:68) 


GABAtc 


AAGCCTCAGAGT 

CCTTCCAGTATG 

(SEQIDNO:36) 


AAATATAAGTGAAG 
AAAAAAATTAGTAG 
AT (SEQ ID NO:72) 


AATCCATTGTATCT 
TAGAACCGAGGGA 
TTTGTTTAGA (SEQ 
IDNO:38) 


B511S 


GACATTCCAGTTT 

TACCCAAATGG 

(SEQIDNO:69) 


TGCAGAAGACTCAA 
GCTGATTCC (SEQ ID 
NO:70) 


TCTCAGGGACACAC 
TCTACCATTCGGGA 
(SEQIDN0:71) 



The concentrations used in the quantitative PGR for the forward primers for 
mammaglobin, GABAtt, B305D C form, B311D, B511S, B533S and B726P were 900, 
10 900, 300, 900, 900, 300 and 300nM respectively. For the reverse primers they were 300, 
900, 900, 900, 300, 900 and 900nM respectively. Primers and probes so produced were 
used in the universal thermal cycling program in real-time PGR. They were titrated to 
determine the optimal concentrations using a checkerboard approach. A pool of cDNA 
from target tumors was used in this optimization process. The reaction was performed in 
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25\i\ volumes. The final probe concentration in all cases was 160nM, dATP, dCTP and 
dGTP were at 0.2mM and dUTP at 0.4mM. Amplitaq gold and Amperase UNG (PE 
Biosystems, Foster City, CA) were used at 0.625 units and 0.25 units per reaction. MgCli 
was at a final concentration of 5mM. Trace amounts of glycerol, gelatin and Tween 20 
5 (Sigma Chem Co, St Louis, MO) were added to stabilize the reaction. Each reaction 
contained 2|lx1 of diluted template. The cDNA from RT reactions prepared as above was 
diluted 1:10 for the gene of interest and 1:100 for p-Actm. Primers and probes for p-Actin 
(PE Biosystems, Foster City, CA) were used in a similar manner to quantitate the presence 
of p-actin in the samples. In the case of the SYBR® green assay, the reaction mix (25\i\) 
10 included 2.5|li1 of SYBR green buffer, 2|il of cDNA template and 2,5|il each of the forward 
and reverse primers for the gene of interest. This mix also contained 3mM MgCb, 
0.25units of AmpErase UNG, 0.625 units of Amplitaq gold, 0.08% glycerol, 0.05% gelatin, 
0.0001% Tween 20 and ImM dNTP mix. In both formats, 40 cycles of amplification were 
performed. 

15 In order to quantitate the amount of specific cDNA (and hence initial 

mRNA) in the sample, a standard curve was generated for each run using the plasmid 
containing the gene of interest. Standard curves were generated using the Ct values 
determined in the real-time PCR which were related to the initial cDNA concentration used 
in the assay. Standard dilutions ranging firom 20-2x10^ copies of the gene of interest were 

20 used for this purpose. In addition, a standard curve was generated for the housekeeping 
gene p-actin ranging fi^om 200fg-2000pg to enable normalization to a constant amount of 
p-Actin. This allowed the evaluation of the over-expression levels seen with each of the 
genes. 

The genes B31 ID, B533S and B726P were evaluated in quantitative PCR as 
25 described above on two different panels consisting of: (a) breast tumor, breast normal and 
normal tissues; and (b) breast tumor metastases (primarily lymph nodes), using the primers 
and probes shown above in Table 1 . The data for panel (a) is shown in Figure 1 for all three 
genes. The three genes showed identical breast tissue expression profiles. However, the 
relative level of gene expression was very different in each case. B311D in general was 
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expressed at lower levels than B533S and both less than B726P, but all three were 
restricted to breast tissue. The quantitative PGR thus confirmed there was a differential 
expression between normal breast tissue and breast tumors for all three genes, and that 
approximately 50% of breast tumors over-expressed these genes. When tested on a panel of 
5 distant metastases derived from breast cancers all three genes reacted with 14/21 
metastases and presented similar profiles. All three genes also exhibited increasing levels 
of expression as a function of pathological stage of the tumor, as shown for B533S in 
Figure 2. 

Mammaglobin is a homologue of a rabbit uteroglobin and the rat steroid 
10 binding protein subunit C3 and is a low molecular weight protein that is highly 
glycosylated. In contrast to its homologs, mammaglobin has been reported to be breast 
specific and over-expression has been described in breast tumor biopsies (23%) and 
primary and metastatic breast tumors (-75%) with reports of the detection of 
mammaglobin mRNA expression in 91% of lymph nodes from metastatic breast cancer 
15 patients. However, more rigorous analysis of mammaglobin gene expression by 
microarray and quantitative PGR as described above (panels (a) and (b) and a panel of 
other tumors and normal tissues and additional breast tumors), showed expression at 
significant levels in skin and salivary gland with much lower levels in esophagus and 
trachea, as shown in Table 2 below. 

20 
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Table 2 

Nomalized Distribution of Manmi^lobin and B5 1 1 S mRNA in Various Tissues 



Tissue 


Mean Copies 
Mammaglobin 
/ng P-Actin + 
SD 


PGR 
Positive 


Mean Copies 
B511S 
/ngP-ActinlSD 


PGR 
Positive 


PGR Positive 
(Mammaglobin/ 
B511S) 


Breast 
Tumors 


1233.8813612 
.74 


31/42 


1800.4013893.24 


33/42 


38/42 


Breast 
tumor 
Metastases 


1912.5414625 
.85 


14/24 


3329.50110820.71 


14/24 


17/24 


Benign 
Breast 
disorders 


121.87+78.63 


3/3 


524.661609.43 


2/3 


3/3 


"XT 1 

Normal 
breast 


114.19±94.40 


11/11 
11/11 


517.64+376.83 


8/9 


11/11 
11/11 


Breast 
reQucuon 


231.50±276.6 

Q 
O 


213 


482.541680.28 


1/ 

72 


LID 


uiner 
tumors 


A 1 -34.0 tic 


1 


Z4.1 /XJO.UU 






Salivary 

nrlofiH 


435.65±705.1 

1 
1 


2/3 


45766.61144342.43 


3/3 




Skin 


415.74±376.1 
4 


7/9 


7039.0517774.24 


9/9 




Esophagus 


4.45±3.86 


2/3 


1.0210.14 


0/3 




Bronchia 


0.16 


0/1 


84.44153.31 


2/2 




Other 
normal 
tissues 


0.3311.07 


0/85 


5.49110.65 


3/75 





5 The breast-specific gene B511S, while having a different profile of 

reactivity on breast tumors and normal breast tissue to mammaglobin, reacted with the 
same subset of normal tissues as mammaglobin. B51 IS by PSORT analysis is indicated to 
have an ORF of 90aa and to be a secreted protein as is the case for mammaglobin. B51 IS 
has no evidence of a transmembrane domain but may harbor a cleavable signal sequence. 
10 Mammaglobin detected 14/24 of distant metastatic breast tumors, 31/42 breast tumors and 
exhibited ten-fold over-expression in tumors and metastases as compared to normal breast 
tissue. There was at least 300-fold over-expression in normal breast tissue versus other 
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negative normal tissues and tumors tested, which were essentially negative for 
mammaglobin expression. B511S detected 33/42 breast tumors and 14/24 distant 
metastases, while a combination of B51 IS with mammaglobin would be predicted to detect 
38/42 breast tumors and 17/24 metastatic lesions (Table 2 above). The quantitative level of 
expression of B511S and mammaglobin were also in similar ranges, in concordance with 
the microarray profiles observed for these two genes. Other genes that were additive with 
mammaglobin are shown in Table 3. 
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Table 3 

mRNA Complementation of Mammaglobin with Other Genes 





Mammaglobin 
Positive 


Mammaglobin Negative 






B305D 


GABAw 


B726P 


B305D + 
GABAu 


B305D + GABAti 
+ B726P 


Breast 

Metastases 


13/21 


2/8 


5/8 


3/8 


7/8 


8/8 


Breast 
tumors 


18/25 


3/7 


4/7 


5/7 


7/7 


7/7 



B305D was shown to be highly over-expressed in breast tumors, prostate 

5 tumors, normal prostate tissue and testis compared to normal tissues, including normal 
breast tissue. Different splice variants of B305D have been identified with form A and C 
being the most abundant but all tested have similar tissue profiles in quantitative PGR. The 
A and C forms contain ORF's of 320 and 385 aa, respectively. B305D is predicted by 
PSORT to be a Type II membrane protein that comprises a series of ankyrin repeats. A 

10 known gene shown to be complementary with B305D, in breast tumors, was GABAti. This 
gene is a member of the GABAa receptor family and encodes a protein that has 30-40% 
amino acid homology with other family members, and has been shown by Northern blot 
analysis to be over-expressed in lung, thymus and prostate at low levels and highly over- 
expressed in uterus. Its expression in breast tissue has not been previously described. This 

15 is in contrast to other GABAa receptors that have appreciable expression m neuronal 
tissues. Tissue expression profiling of this gene showed it to be over-expressed in breast 
tumors in an inverse relationship to the B305D gene (Table 3). GABAtc detected 15/25 
tumors and 6/21 metastases including 4 tumors and 5 metastases missed by mammaglobin. 
In contrast, B305D detected 13/25 breast tumors and 8/21 metastases, again including 3 

20 tumors and 2 metastases missed by mammaglobin. A combination of just B305D and the 
GABAtt would be predicted to identify 22/25 breast tumors and 14/21 metastases. The 
combination of B305D and GABAtt with mammaglobin in detecting breast metastases is 
shown in Table 3 above and Figures. 3A and 3B. This combination detected 20/21 of the 
breast metastases as well as 25/25 breast tumors that were evaluated on the same panels for 
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all three genes. The one breast metastasis that was negative for these three genes was 
strongly positive for B726P (Figs. 3 A and 3B). 

To evaluate the presence of circulating tumor cells, an immunocapture (cell 
capture) method was employed to first enrich for epithelial cells prior to RT-PCR analysis. 
5 Immunomagnetic polystyrene beads coated with specific monoclonal antibodies to two 
glycoproteins on the surface of epithelial cells were used for this purpose. Such an 
enrichment procedure increased the sensitivity of detection (-100 fold) as compared to 
direct isolation of poly A"^ RNA, as shown in Table 4. 
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Table 4 

Extraction of Mammaglobin Positive Cells (MB415) Spiked 
into Whole Blood and Detection by Real-time PGR 



MB415 cells/ml Blood 


Epithelial cell extraction 
(Poly RNA) 


Direct Extraction 
(Poly A^ RNA) 




Copies Mammaglobin/ng P-Actin 


100000 


54303.2 


58527.1 


10000 


45761.9 


925.9 


1000 


15421.2 


61.6 


100 


368.0 


5.1 


10 


282.0 


1.1 


1 


110.2 


0 


0 


0 


0 



5 Mammaglobin-positive cells (MB415) were spiked into whole blood at 

various concentrations and then extracted using either epithelial cell enrichment or direct 
isolation from blood. Using enrichment procedures, mammaglobin mRNA was found to be 
detectable at much lower levels than when direct isolation was used. Whole blood samples 
from patients with metatastic breast cancer were subsequently treated with the 

10 immunomagnetic beads. Poly A"^ RNA was then isolated, cDNA prepared and run in 
quantitative PGR using two gene specific primers (Table 1) and a fluorescent probe 
(Taqman"^"), As observed in breast cancer tissues, complementation was also seen in the 
detection of circulating tumor cells derived from breast cancers. Again, mammaglobin 
PGR detected circulating tumor cells in a high percentage of blood samples, albeit at low 

15 levels, from metastatic breast cancer patients (20/32) when compared to the normal blood 
samples (Table 5) but several of the other genes tested to date ftirther increased this 
detection rate. This included B726P, B305D, B311D, B533S and GABAti. The detection 
level of mammaglobin in blood samples from metastatic breast cancer patients is higher 
than described previously (62 vs. 49%), despite testing smaller blood volumes, probably 

20 because of the use of epithelial marker-specific enrichment in our study. A combination of 
all the genes tested indicate that 27/32 samples were positive by one or more of these 
genes. 
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Table 5 

Gene Complementation in Epithelial Cells Isolated from Blood of Normal Individuals and 

Metastatic Breast Cancer Patients ^ 



Sample ID 


Mammaglobin 


B305D 


B311D 


B533S 


B726P 


GABAti 


Combo 


2 




- 


- 


+ 




- 


4- 


3 


+ 


- 


- 




- 




+ 


5 


+ 


+ 




- 


+ 




+ 


6 




- 


+ 


+ 


+ 




4- 


8 


- 


- 


+ 


- 






+ 


9 




-f- 


+ 


- 


+ 




+ 


10 


+ 


- 




- 


+ 




+ 


11 


- 


- 


- 


- 








12 


+ 


+ 


+ 


- 








13 


- 


- 


- 


+ 






+ 


15 


- 


- 


- 


- 


- 






18 


+ 


- 


- 


- 


- 






19 


+ 


- 


- 


- 


- 


+ 




21 




- 


- 


- 


- 


- 


+ 


22 


- 


- 


- 


- 


- 


- 


- 


23 


+ 


- 


- 


- 


- 


- 


4- 


24 




- 


- 


- 


- 


- 


4- 


25 


- 


+ 


- 


- 


- 


- 


4- 


26 


- 


- 


- 


- 


- 


- 


- 


29 


+ 


- 


+ 




+ 






31 


+ 


- 


- 




- 


- 


+ 


32 


- 


- 


- 


- 


- 


± 


± 


33 


- 


- 


- 


- 




- 


4- 


34 


+ 


- 


- 


+ 


- 


+ 


4- 


35 


+ 


- 


- 


- 




- 


4- 


36 


- 


- 


- 


- 


- 


+ 


+ 


37 


+ 


- 


- 


+ 


- 




4- 


38 


- 


- 


- 


- 


- 






40 


+ 


- 


- 


- 


- 




+ 


41 


+ 


- 


- 


+ 


- 


- 


4- 


42 


+ 


- 


- 


- 


- 


- 


+ 


43 


- 


- 


- 


- 


- 


+ 


4- 


Donor 104 


- 


- 


- 


- 


- 


+ 


+ 


Donor 348 


- 


- 


- 


- 


- 


Nd 




Donor 392 


- 


- 


- 


- 


- 


Nd 


- 


Donor 408 












Nd 




Donor 244 
















Donor 355 
















Donor 264 
















Donor 232 












Nd 




Donor 12 
















Donor 415 












Nd 




Donor 35 
















Sensitivity 


20/32 


4/32 


7/32 


9/32 


7/32 


4/32 


27/32 
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In further studies, mammaglobin, GABAtt, B305D (C form) and B726P 
specific primers and specific Taqman probes were employed in different combinations to 
analyze their combined mRNA expression profile in breast metastases (B. met) and breast 
tumor (B. tumors) samples using real-time PGR. The forward and reverse primers and 
5 probes employed for mammaglobin, B305D (C form) and B726P are shown in Table 1. 
The forward primer and probe employed for GABAti are shown in Table 1, with the 
reverse primer being as follows: TTCAAATATAAGTGAAGAAAAAATTAG- 
TAGATCAA (SEQ id N0:51). As shown below in Table 6, a combination of 
mammaglobin, GABAtt, B305D (C form) and B726P was found to detect 22/22 breast 
1 0 tumor samples, with an increase in expression being seen in 5 samples (indicated by ++), 
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Table 6 



Real-time PCI 


1 Detection of Tumor Samples using Different Primer Combinations 


Tumor sample 


Mammaglobin 


Mammaglobin + 
GABA 


Mammaglobin + 
GABA + B305D 


Mammaglobin + GABA 
+ B305D + B726P 


B. Met316A 




+ 




+ 


B.Met317A 


+ 


4- 


+ 


+ 


B, Met318A 




+ 


+ 




B. Met 595A 


+ 


+ 


+ 


+ 


B. Met 611 A 


+ 


+ 


+ 


+ 


B. Met612A 




+ 


+ 


+ 


B. Met614A 




+ 


+ 


+ 


B.Met616A 




+ 


+ 


+ 


B, Met618A 


+ 


+ 


+ 


+ 


B. Met 620A 


+ 


+ 






B.Met621A 


+ 


+ 


+ 


+ 


B. Met 624A 


+ 


+ 


+ 


+ 


B. Met 625A 






+ 




B. Met 627A 


+ 




+ 


+ 


B. Met 629A 




+ 


+ 


+ 


B. Met 631 A 


+ 


+ 


+ 


+ 


B. Tumor 154 A 


+ 


+ 




++ 


B. Tumor 155A 


+ 


+ 


+ 


++ 


B. Tumor 8 ID 






+ 


++ 


B. Tumor 209A 




+ 




+ 


B. Tumor 208A 




+ 


+ 


++ 


B, Tumor lOA 




+ 




+ 



The increase of message signals by the addition of specific primers was 
5 further demonstrated in a one plate experiment employing the four tumor samples B. met 
316A, B. met 317A, B. tumor 81D and B. tumor 209A. 

Expression of a combination of mammaglobin, GABAti, B305D (C form) 
and B726P in a panel of breast tumor and normal tissue samples was also detected using 
real-time PGR with a SYBR Green detection system instead of the Taqman probe 
10 approach. The results obtained using this system are shown in Figure 7. 

EXAMPLE 7 

Quantitative PGR in Peripheral Blood of Breast Ganger Patients 
The known genes evaluated in this study were mammaglobin and y 
15 aminobutyrate type A receptor n subunit (GABAti). In order to identify novel genes which 
are over-expressed in breast cancer we have used an improved version of the differential 
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display RT-PCR (DDPCR) technique (Liang et al., Science 257:967-971 (1993); Mou et 
al., Biochem Biophy Res Commun. 199:564-569 (1994)); cDNA library extraction 
methods (Kara et al., Blood 84:189-199 (1994)) and PGR subtraction (Diatchenko et al., 
Proc Natl Acad Sex USA., 93:6025-6030 (1996); Yang et al., Nucleic Acids Res. 27:1517- 
5 23 (1999)). 

Differential display resulted in the recovery of two cDNA fragments 
designated as B305D and B311D (Houghton et al., Cancer Res. 40:Abstract #217, 32-33, 
(1999). B511S and B533S are two cDNA fragments isolated using cDNA library 
subtraction approach (manuscript in preparation) while the B726P cDNA fragment was 

10 derived from PGR subtraction (Jiang et al.. Proceedings of the Amer Assoc Gancer Res. 
40:Abstract #216, 32 (1999); Xu et al., Proceedings of the Amer Assoc Gancer Res. 
40: Abstract #2115, 319 (1999); and Molesh et al.. Proceedings of Amer Assoc Gancer Res. 
41: Abstract #4330, 681 (2000). 

Three of the novel genes, B31 ID, B533S and B726P, showed identical 

15 breast tissue expression profile by quantitative PGR analysis. These genes were evaluated 
in quantitative PGR on two different panels consisting of (a) breast tumor, breast normal 
and normal tissues and (b) panel of breast tumor metastases (primarily lymph nodes). 
Primers and probes used are shown in Table 1. The data for panel (a) is shown in Figure 2 
for all three genes. Overall, the expression profiles are comparable and are in the same rank 

20 order, however, the levels of expression are considerably different. B31 ID in general was 
expressed at lower levels than B533S and both less than B726P but all three were resfricted 
to breast tissue. All three sequences were used to search against the Genbank database. 
Both B311D and B533S sequences contain different repetitive sequences and an ORF has 
not been identified for either. B726P is a novel gene, with mRNA splicing yielding several 

25 different putative ORF's. 

The quantitative PGR confirmed there was a differential mRNA expression 
between normal breast tissue and breast tumors, with approximately 50% of breast tumors 
overexpressed these genes. When tested on a panel of distant metastases derived from 
breast cancers all three genes reacted with 14/21 metastases and presented similar profiles 
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(data not shown). Interestingly, when tested on a prostate cancer panel, all three genes 
identified the same 3/24 prostate tumors but at much lower expression levels than in breast. 
This group of genes exhibited increasing levels of expression as a function of pathological 
stage of the tumor as shown for B533S. 
5 More rigorous analysis of mammaglobin gene expression by microarray, 

and quantitative PGR showed expression at significant levels in skin and salivary gland and 
much lower levels in esophagus and trachea. B511S had a slightly different profile of 
reactivity on breast tumors and normal breast tissue when compared to mammaglobin, yet 
reacted with a similar subset of normal tissues as mammaglobin. Mammaglobin detected 

10 14/24 of distant metastatic breast tumors, 31/42 breast tumors and exhibited ten-fold over- 
expression in tumors and metastases as compared to normal breast tissue. There was at 
least 300-fold over-expression of mammaglobin in normal breast tissue versus other 
negative normal tissues and tumors tested. B511S detected 33/42 breast tumors and 14/24 
distant metastases. A combination of B511S with mammaglobin would be predicted to 

15 detect 38/42 breast tumors and 17/24 metastatic lesions. The quantitative level of 
expression of B511S and mammaglobin were also in similar ranges, in concordance with 
the microarray profiles observed for these two genes. 

Certain genes complemented mammglobin's expression profile, i.e. were 
shown to express in tumors that mammaglobin did not. B305D was highly over-expressed 

20 in breast tumors, prostate tumors, normal prostate tissue and testis compared to normal 
tissues including normal breast tissue. Different splice variants of B305D were identified 
with the forms A and C being the most abundant. All forms tested had similar tissue 
profiles in quantitative PGR. The A and G forms contain ORF's of 320 and 385 aa, 
respectively. A known gene shown to be complementary with B305D, in breast tumors, 

25 was GABAtt. This tissue expression profile is in contrast to other GABAa receptors that 
typically have appreciable expression in neuronal tissues. An additional observation was 
that tissue expression profiling of this gene showed it to be over-expressed in breast tumors 
m an inverse relationship to the B305D gene (Table 3). GABAtc detected 15/25 tumors and 
6/21 metastases including 4 tumors and 5 metastases missed by mammaglobin. In contrast, 
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B305D detected 13/25 breast tumors and 8/21 metastases again including 3 tumors and 2 
metastases missed by mammaglobin. A combination of just B305D and the GABAtt would 
be predicted to identify 22/25 breast tumors and 14/21 metastases. This combination 
detected 20/21 of the breast metastases as well as 25/25 breast tumors that were evaluated 
5 on the same panels for all three genes. The one breast metastasis that was negative for 
these three genes was strongly positive for B726P. 

The use of microarray analysis followed by quantitative PGR provided a 
methodology to accurately determine the expression of breast cancer genes both in breast 
tissues (tumor and normal) as well as in normal tissues and to assess their diagnostic and 

10 therapeutic potential. Five novel genes and two known genes were evaluated using these 
techniques. Three of these genes B31 ID, B533S and B726P exhibited concordant mRNA 
expression and collectively the data is consistent with coordinated expression of these three 
loci at the level of transcription control All three genes showed differential expression in 
breast tumors versus normal breast tissue and the level of overexpression appeared related 

15 to the pathological stage of the tumor. In the case of mammaglobin, expression was found 
in other tissues apart from breast tissue. Expression was seen in skin, salivary gland and to 
a much lesser degree in trachea. 

Expression of GABAti in breast tumors was also a novel observation. While 
the expression of several genes complemented that seen with mammaglobin, two genes in 

20 particular, B305D and GABAti; added to the diagnostic sensitivity of mammaglobin 
detection. A combination of these three genes detected 45/46 (97.8%) breast tumors and 
metastases evaluated. Inclusion of B726P enabled the detection of all 25 of the breast 
tumors and 21 distant metastases. 



25 EXAMPLE 8 

Enrichment of Circulating Breast Cancer Cells by Immunocapture 

This example discloses the enhanced sensitivity achieved by use of the 
immunocapture cell capture methodology for enrichment of circulating breast cancer cells. 
To evaluate the presence of circulating tumor cells an immunocapture 
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method was adopted to first enrich for epithelial cells prior to RT-PCR analysis. Epithelial 
cells were enriched from blood samples with an immunomagnetic bead separation method 
(Dynal A.S, Oslo, Norway) utilizing magnetic beads coated with monoclonal antibodies 
specific for glycopolypeptide antigens on the surface of human epithelial cells. 

5 (Exemplary suitable cell-surface antigens are described, for example, in Momburg, F, et 
al, Cancer Res,, 41:2883-91 (1997); Naume, B. et al., Journal of Hemotherapy. 6:103-1 13 
(1997); Naume, B. et al., Int J Cancer. 78:556-60 (1998); Martin, V.M. et al., Exp 
HematoL, 26:252-64 (1998); Hildebrandt, M. et al, Exp Hematol 25:57-65 (1997); Eaton, 
M.C. et al., Biotechniques 22:100-5 (1997); Brandt, B. et al., Clin Exp Metastases 14:399- 

10 408 (1996), each of which are mcorporated herein by reference in their entirety. Cells 
isolated this way were lysed and the magnetic beads removed. The lysate was then 
processed for poly A"^ mRNA isolation using magnetic beads (Dynabeads) coated with 
Oligo (dT) 25. After washing the beads in the kit buffer bead/poly A'^RNA samples were 
finally suspended in lOmM Tris HCl pH 8 and subjected to reverse transcription. The 

15 RNA was then subjected to Real time PCR using gene specific primers and probes with 
reaction conditions as outlined herein above. p-Actin content was also determined and 
used for normalization. Samples with gene of interest copies/ng p-actin greater than the 
mean of the normal samples + 3 standard deviations were considered positive. Real time 
PCR on blood samples was performed exclusively using the Taqman^"^ procedure but 

20 extending to 50 cycles. 

Mammaglobin mRNA using enrichment procedures was found to be 
detectable at much lower levels than when direct isolation was used. Whole blood samples 
from patients with metatastic breast cancer were subsequently treated with the 
immunomagnetic beads, polyA"^ RNA was then isolated, cDNA made and run in 

25 quantitative PCR using two gene specific primers to mammaglobin and a fluorescent probe 
(Taqman™). As observed in breast cancer tissues, complementation was also seen in the 
detection of circulating tumor cells derived firom breast cancers. Again, mammaglobin 
PCR detected circulating tumor cells in a high percentage of bloods, albeit at low levels, 
from metastatic breast cancer (20/32) when compared to the normal blood samples. 
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Several of the other genes tested to date could further increase this detection rate; tiiis 
includes B726P, B305D, B311D, B533S and GABAti. A combination of all the genes 
tested indicates that 27/32 samples were positive by one or more of these genes. 

5 EXAMPLE 9 

Multiplex Detection of Breast Tumors 
Additional Multiplex Real-time PGR assays were established in order to 
simultaneously detect the expression of four breast cancer-specific genes: LipophilinB, 
Gaba (B899P), B305D-C and B726P. In contrast to detection approaches relying on 
10 expression analysis of suigle breast cancer-specific genes, this Multiplex assay was able to 
detect all breast tumor samples tested. 

This Multiplex assay was designed to detect LipophilinB expression instead 
of Mammaglobin. Due to their similar expression profiles, LipophilinB can replace 
Mamm^lobin in this Multiplex PGR assay for breast cancer detection. The assay was 
15 carried out as follows: LipophilinB, B899P (Gaba), B305D, and B726P specific primers, 
and specific Taqman probes, were used to analyze their combined mRNA expression 
profile in breast tumors. The primers and probes are shown below: 

LipophilinB: Forward Primer (SEQ ID NO: 33): 5' 
TGCCGGTGCGGAAGGT. Reverse Primer (SEQ ID NO:34): 5' 
20 GGTTTGTGAAGGGAGATGTGATC. Probe (SEQ ID NO: 35) (FAM-5' - 3'-TAMRA): 
TTGCAGCCAAGTTAGGAGTGAAGAGATGGA. 

GABA (B899P): Forward Primer (SEQ ID NO: 36): 5' 
AAGCGTGAGAGTCGTTGCAGTATG. Reverse Primer (SEQ ID NO: 37): 5' 
TTGAAATATAAGTGAAGAAAAAATTAGTAGATCAA. Probe (SEQ ID NO: 38) 
25 (FAM-5' - 3 '-TAMRA): AATCGATTGTATGTTAGAAGGGAGGGATTTGTTTAGA. 

B305D (C form): Forward Primer (SEQ ID NO: 39): 5' 
AAAGCAGATGGTGGTTGAGGTT. Reverse Primer (SEQ ID NO: 40): 5' 
CCTGAGACCAAATGGCTTCTTG. Probe (SEQ ID NO: 41) (FAM-5' - 3'-TAMRA) 
ATTGCATGGGGGCTGCTTGTTCTG. 
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B726P: Forward Primer (SEQ ID NO: 42): 5' 
TCTGGTTTTCTCATTCTTTATTCATTTATT. Reverse Primer (SEQ ID NO: 43): 5' 
TGCCAAGGAGCGGATTATCT. Probe (SEQ ID NO: 44) (FAM-5' - 3'-TAMRA): 
CAACCACGTGACAAACACTGGAATTACAGG. 
5 Actin: Forward Primer (SEQ ID NO: 45): 5' 

ACTGGAACGGTGAAGGTGACA. Reverse Primer (SEQ ID NO 46): 5' 
CGGCCACATTGTGAACTTTG. Probe (SEQ ID NO: 47): (FAM-5' - 3'-TAMRA): 
CAGTCGGTTGGAGCGAGCATCCC. 

The assay conditions were: 
10 Taaman protocol (7700 PerMn Elmer) : 

In 25 [il final volume: Ix Buffer A, 5mM MgCl, 0.2 mM dCTP, 0.2 mM 
dATP, 0.4 mM dUTP, 0.2 mM dGTP, 0.01 U/^il AmpErase UNG, 0.025 u/^1 TaqGold, 8% 
(v/v) Glycerol, 0.05% (v/v) Gelatin, 0.01% (v/v) Tween20, 4 pmol of each gene specific 
Taqman probe (LipophilinB + Gaba + B305D + B726P), 100 nM of B726P-F + B726P-R, 
15 300 nM of Gaba-R, and 50 nM of LipophilinB-F + LipophiUnB-R + B305D-R + Gaba-R, 
template cDNA (originating from 0.02 |ag polyA + RNA). 

LipophilinB expression was detected in 14 out of 27 breast tumor samples. 
However, the Multiplex assay for LipophilinB, B899P, B305D-C and B726P detected an 
expression signal in 27 out of 27 tumors with the detection level above 10 mRNA 
20 copies/1000 pg actin in the majority of samples and above 100 mRNA copies/1000 pg actin 
in 5 out of the 27 samples tested (Figure 8). 

EXAMPLE 10 
Multiplex Detection Optimization 
25 The Multiplex Real-time PGR assay described above was used to detect the 

expression of Mammaglobin (or LipophilinB), Gaba (B899P), B305D-C and B726P 
simultaneously. According to this Example, assay conditions and primer sequences were 
optimized to achieve parallel amplification of four PGR products with different lengths. 
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Positive samples of this assay can be further characterized by gel electrophoresis and the 
expressed gene(s) of interest can be determined according to the detected amplicon size(s). 

Mammaglobin (or LipophilinB), Gaba (B899P), B305D and B726P specific 
primers and specific Taqman probes were used to simultaneously detect their expression. 
5 The primers and probes used in this example are shown below. 

Mammaglobin: Forward Primer (SEQ ID NO: 48): 5' 
TGCCATAGATGAATTGAAGGAATG. Reverse Primer (SEQ ID NO: 49): 5' 
TGTCATATATTAATTGCATAAACACCTCA. Probe (SEQ ID NO: 50): (FAM-5' - 3'- 
TAMRA):TCTTAACCAAACGGATGAAACTCTGAGCAATG. 
10 GABA (B899P): Forward Primer (SEQ ID NO: 36): 5' 

AAGCCTCAGAGTCCTTCCAGTATG. Reverse Primer (SEQ ID NO: 51): 5' 
ATCATTGAAAATTCAAATATAAGTGAAG. Probe (SEQ ID NO: 38) (FAM-5' - 3'- 
TAMRA)AATCCATTGTATCTTAGAACCGAGGGATTTGTTTAGA. 

B305D (C form): Forward Primer (SEQ ID NO: 39): 5' 
15 AAAGCAGATGGTGGTTGAGGTT. Reverse Primer (SEQ ID NO: 40): 5' 
CCTGAGACCAAATGGCTTCTTC. Probe (SEQ ID NO: 41): (FAM-5' - 3'-TAMRA): 
ATTCCATGCCGGCTGCTTCTTCTG. 

B726P: Forward Primer (SEQ ID NO: 52): 5' 
GTAGTTGTGCATTGAAATAATTATCATTAT. Reverse Primer (SEQ ID NO: 43): 5' 
20 TGCCAAGGAGCGGATTATCT. Probe (SEQ ID NO: 44) (FAM-5' - 3'-TAMRA): 
CAACCACGTGACAAACACTGGAATTACAGG. 

Primer locations and assay conditions were optimized to achieve parallel 
amplification of four PGR products with different sizes. The assay conditions were: 

Taqman protocol (7700 Perkin Elmer) : 
25 In 25 ^1 fmal volume: Ix Buffer A, 5 mM MgCl, 0.2 mM dCTP, 0.2 mM 

dATP, 0.4 mM dUTP, 0.2 mM dGTP, 0.01 U/^il AmpErase UNG, 0.0375 U/^il TaqGold, 
8% (v/v) Glycerol, 0.05% (v/v) Gelatin, 0.01% (v/v) Tween20, 4 pmol of each gene 
specific Taqman probe (Mammaglobin + Gaba + B305D + B726P), 300 nM of Gaba-R + 
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Gaba-F, 100 nM of Mammaglobin-F + R; B726P-F + R, and 50 nM of B305D-F + R 
template cDNA (originating from 0.02 (|ig polyA + RNA). 
PCR protocol : 

S(f for 2': x 1, 95^ for 10^: X 1, and 95° for 15" / 60° for 1' / 68 ° for 1': x 

5 50. 

Since each primer set in the muhiplex assay results in a band of unique 
length, expression signals of the four genes of interest can be measured individually by 
agarose gel analysis {see^ Figure 9), or the combined expression signal of all four genes can 
be measured in real-time on an ABI 7700 Prism sequence detection system (PE 

10 Biosystems, Foster City, CA). The expression of LipophilinB can also be detected instead 
of Mammaglobin. Although specific primers have been described herein, different primer 
sequences, different primer or probe labeling and different detection systems could be used 
to perform this multiplex assay. For example, a second fluorogenic reporter dye could be 
incorporated for parallel detection of a reference gene by real-time PCR. Or, for example a 

1 5 S YBR Green detection system could be used instead of the Taqman probe approach. 



EXAMPLE 11 

Design and Use of Genomic DNA-Excluding, Intron-Exon Border Spanning 
Primer Rairs for Breast Cancer Multiplex Assay 

20 The Multiplex Real-time PCR assay described herein can detect the 

expression of Mammaglobin, Gaba (B899P), B305D-C and B726P simultaneously. The 
combined expression levels of these genes is measured in real-time on an ABI 7700 Prism 
sequence detection system (PE Biosystems, Foster City, CA). Individually expressed 
genes can also be identified due to different amplicon sizes via gel electrophoresis. In 

25 order to use this assay with samples derived from non-DNase treated RNAs {e.g. lymph 
node cDNA) and to avoid DNase-treatment for small RNA-samples (e.g. from blood 
specimens, tumor and lymph node aspirates), intron-spaiming primer pairs have been 
designed to exclude the amplification of genomic DNA and therefore to eliminate 
nonspecific and false positive signals. False positive signal is caused by genomic DNA 
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contamination in cDNA specimens. The optimized Multiplex assay described herein 
excludes the amplification of genomic DNA and allows specific detection of target gene 
expression without the necessity of prior DNase treatment of RNA samples. Moreover the 
genomic match and the location of the Intron-Exon border could be verified with these 
5 primer sets. 

Mammaglobin, Gaba (B899P), B305D and B726P specific primers and 
specific Taqman probes were used to simultaneously detect their expression (Table 7). 
Primer locations were optimized (Intron-Exon border spanning) to exclusively detect 
cDNA and to exclude genomic DNA from amplification. The identity of the expressed 
1 0 gene(s) was determined by gel electrophoresis. 



Table 7 



Intron-Exon border Spanning Primer and Probe Sequences 
for Breast Tumor Multiples Assay 



Gene 


Forward Primer 


Reverse Primer 


Taqman probe 
(FAM-5'-3'TAMRA) 


Mammaglobin 


tgccatagatgaattgaagga 
atg (SEQ ID NO:48) 


tgtcatatattaattgcataaacacct 
ca (SEQIDNO:49) 


tcttaaccaaacggatgaaactctgagca 
atg (SEQIDNO:50) 


B899P 


aagcctcagagtccttccagta 
tg(SEQ IDNO:36) 


ttcaaatataagtgaagaaaaaatta 
gtagatcaa (SEQ ID 
NO:37) 


aatccattgtatcttagaaccgagggattt 

gttt (SEQ ID NO:62) 


B305D 


aaagcagatggtggttgaggt 
t(SEQ IDNO:39) 


cctgagaccaaatggcttcttc 
(SEQ ID NO:40) 


attccatgccggctgcttcttctg (SEQ 
ID NO:41) 


B726P 


tctggttttctcattctttattcatt 
tatt(SEO IDNO:42) 


tgccaaggagcggattatct 
(SEQIDNO:43) 


caaccacgtgacaaacactggaattaca 

gg (SEQ ID NO:44) 


Actin 


actggaacggtgaaggtgac 
a (SEQ IDNO:45 


cggccacattgtgaactttg 
(SEQ IDNO:46) 


cagtcggttggagcgagcatccc 
(SEQ IDNO:47) 


B899P-INT 


caattttggtggagaacccg 
(SEQ ID NO:53) 


gctgtcggaggtatatggtg 
(SEQ IDNO:54) 


catttcagagagtaacatggactacaca 
(SEQ ID NO:55) 


B305D-INT 


tctgataaaggccgtacaatg 
(SEQ ID NO:56) 


tcacgacttgctgtttttgctc 
(SEQ ID NO:57) 


atcaaaaaacaagcatggcctcacacca 
ct (SEQ ID NO:58) 


B726P-INT 


gcaagtgccaatgatcagagg 
(SEQ IDNO:59) 


atatagactcaggtatacacact 
(SEQ IDNO:60) 


tcccatcagaatccaaacaagaggaaga 
tg(SEQ IDN0:61) 



Primer locations and assay conditions were optimized to achieve parallel 
amplification of the four PGR products. The assay conditions were as follows: 
Taaman protocol (7700 PerUn Elmer) 

In 25^1 final volume: Ix Buffer A, 5 mM MgCl, 0.2 mM dCTP, 0.2 mM 
20 dATP, 0.4 mM dUTP, 0.2 mM dGTP, 0.01 U/AmpErase UNG, 8 % (v/v) Glycerol, 0.05 % 
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(v/v) Gelatin, 0.01 % (v/v) TweenlO, 4 pmol of each gene specific Taqman probe 
(Mammaglobin + B899P-INT + B305D-INT + B726P-INT), 300 nM of B305D-INT-F; 
B899P-INT-F, 100 nM of Mammaglobin-F + R; B726P-INT-F 50 nM of B899P-INT- 
R; B305D-INT-R, template cDNA (originating from 0.02 pig polyA+ RNA). 
5 PCR cycling conditions 

1 cycle at 50°C for 2 minutes, 1 cycle at 95*^C for 10 minutes, 50 cycles of 
95'*C for 1 minute and 68*^0 for 1 minute. 

Figure 10 shows a comparison of the multiplex assay using intron-exon 
border spanning primers (bottom panel) and the multiplex assay using non-optimized 
10 primers (top panel), to detect breast cancer cells in a panel of lymph node tissues. This 
experiment shows that reduction in background resulting from genomic DNA 
contamination in samples is achieved using the intron-exon spanning primers of the present 
invention. 



15 EXAMPLE 12 

Multiplex Detection of Metastasized Breast Tumor Cells in 
Sentinel Lymph Node Biopsy Samples 
Lymph node staging is important for determining appropriate adjuvant 
hormone and chemotherapy. In contrast to conventional axillary dissection a less invasive 

20 approach for staging of minimal residual disease is sentinel lymph node biopsy. Sentinel 
lymph node biopsy (SLNB) has the potential to improve detection of metastases and to 
provide prognostic values to lead to therapy with minimal morbidity associated with 
complete lymph node dissection. SLNB implements mapping of the one or two lymph 
nodes which primarily drain the tumor and therefore are most likely to harbor metastatic 

25 disease (the sentinel nodes). Routine pathological analysis of lymph nodes result in a high 
false-negative rate: one-third of women with pathologically negative lymph nodes develop 
recurrent disease [Bland: The Breast: Saunders 1991], A more sensitive detection technique 
for tumor cells would be RT-PCR but its application is limited by lack of a single specific 
markers. The multimarker assay described above increases the likelihood of cancer 
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detection across the population without producing false-positive results from normal lymph 
nodes. 

As mentioned above, lymphatic afferents from a primary tumor drain into a 
single node, the sentinel lymph node, before drainage into the regional lymphatic basin 
5 occurs. Sentinel lymph nodes are located with dyes and/or radiolabeled colloid injected in 
the primary lesion site and sentinel lymph node biopsy allows pathological examination for 
micrometastatic deposits, staging of the axilla and therefore can avoid imnecessary axillary 
dissection. Nodal micrometastases can be located with staining (haematoxylin or eosin) or 
immunohistochemical analysis for cytokeratin proteins. Immunocytochemical staining 
1 0 techniques can produce frequent false-negative results by missing small metastatic foci due 
to inadequate sectioning of the node. Immunohistochemistry can result in false-positive 
results due to illegitimate expression of cytokeratins (reticulum cells) or in false-negative 
results when using the antibody Ber-Ep4 which corresponding antigen is not expressed on 
all tumor cells. 

15 The multiplex assay described herein could provide a more sensitive 

detection tool for positive sentinel lymph nodes. Moreover the detection of breast cancer 
cells in bone marrow samples, peripheral blood and small needle aspiration samples is 
desirable for diagnosis and prognosis in breast cancer patients. 

Twenty-two metastatic lymph node samples, in addition to 15 samples also 

20 previously analyzed and shown in Figure 3A, were analyzed using the intron-exon border 
spanning multiplex PGR assay described herein. The results from this analysis are 
summarized in Table 8, Twenty-seven primary tumors were also analyzed and the results 
shown in Table 9. Twenty normal lymph node samples tested using this assay were all 
negative. 

25 
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Table 8. 

Multiples Real-time PCR Analysis of 37 Metastatic Lymph Nodes 



breast metastatic 
lymph node samples 


Mammaglobin 


B305D 


B899P 


B726P 


Multiplex 


B.Met317A 


++ 


+ 




+ 




B.Met318A 






-H- 




+++ 


B.Met595A 


+ 






+ 


+-H- 


B.Met611A 


+ 


+ 


+++ 




+4- 


B.Met612A 


++ 


++ 




+ 


++ 


B.Met614A 




++ 




++ 


+-H- 


B.Met616A 






+ 






B.Met618A 


+++ 


+ 






+-H- 


B.Met620A 


++ 


++ 




++ 


+++ 


B.Met 621 A 


+ 


+++ 




+ 


+-H- 


B.Met 624A 






++ 




+++ 


B.Met625A 




++ 






+ 


B.Met 627A 




+ 




+ 


+ 


B.Met 629A 


++ 








+++ 


B.Met 631 A 


+ 








+ 


1255 


+++ 






++ 




1257 


+++ 




+ 


+ 


++ 


769 


+++ 






+ 


++ 


1258 


++ 


+ 


+ 




+ 


1259 




++ 






+++ 


1250 


+++ 


+ 




+ 


+++ 


1726 


4-H- 








-H-+ 


786 


+-H- 




+ 




+++ 


281-LI-r 


+++ 








+++ 


289-L2 


-H- 


+ 






++ 


366-S 


+ 








+ 


374.S+ 


-H-+ 


■H- 






+++ 


376-S 


++ 








++ 


381-S 


+ 


+ 






+ 


383-Sx 


+44- 


++ 






+++ 


496-M 




++ 






+++ 


591-SI-A 


+ 


+ 






+ 


652-1 






-H- 




+++ 


772 


+ 








+ 


777 


+ 






-H- 


++ 


778 










+++ 


779 






++ 




++ 
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Table 9 



Multinlex Rea 


-time PGR Analysis of 27 Primary Breast Tumors 


uicddi pziuidry lumor 
samples 


Matnmaglobin 


B305D 


B899P 


B726P 


Multiplex 


T443 


+ 


++ 




+++ 


+++ 


T457 




+ 


+ 




++ 


T395 






++ 




++ 


TlOA 


+ 


+++ 




+++ 


+++ 


T446 




+ 




++ 


++ 


TllC 






+++ 




-H-+ 


T23B 


+ 








+++ 


T207A 




++ 






+ 


T437 




4- 




++ 


+++ 


T391 


+ 


-H- 




+++ 


+++ 


T392 


+ 


+ 






-H- 


TS76 




++ 






+++ 


T483 










+++ 


T8iG 


+ 




++ 


++ 


+++ 


T430 


+ 




++ 




-H- 


T465 


+ 


-1- 




+ 


-H- 


TS80 






+ 




+ 


T469 


+ 






+ 


+++ 


T467 


+ 






++ 




T439 




+ 






+ 


1 JO/ 


++ 




+ 


+ 


++ 


T318 






+ 




++ 


T154A 








+ 




T387A 


+++ 




+ 




+++ 


T155A 


+ 




++ 




+ 


T209A 










++ 


T208A 




+ 






++ 



5 EXAMPLE 13 

Detection of Circulating Breast Tumor Cells in Peripheral Blood and Bone 
Marrow Using a Multiplex Real-Time PGR Assay 

Circulating breast tumor cells were detected in peripheral blood and bone 
10 marrow by either epithelial cell capture essentially as described in Example 8 and below or 
RosetteSep CD45 depletion followed by real-time multiplex PCR analysis. RosetteSep 
was used to enrich cells directly from a blood sample and consisted of a cocktail of 
tetrameric antibodies that target a variety of unwanted cells and crosslink them to 
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glycophorin A on red blood cells (RBC) present in the sample, forming rosettes. When 
centrifuged over Ficoil, targeted cells pelleted along with the free RBC. 

Three breast cancer cell lines were used for blood spiking experiments in 
different concentrations. All three cell lines (SKBR3, BT474, MB415) express 
5 Manmiaglobin and therefore could be detected by single gene Mammaglobin RT-PCR. 

Blood spiking and epithelial cell capture was carried out as follows: 
Peripheral blood from a normal, healthy donor drawn into EDTA containing vacutainers 
was used in 6 ml aliquots and spiked with 2, 20 or 200 cells/ml SKBR3, BT474 or MB415 
cells. Mononuclear cells were isolated using Sigma Diagnostics Accuspin System- 
ic Histopaque-1077 (Sigma, St. Louis, MO) and resuspended m 5 ml PBS (/w 0.1% (w/v) 
BSA and 0.6% (g/L) Na-Citrate). For bone marrow samples and mononuclear cells, 
epithelial cell capture was performed using 100 ul of Dynabeads Epithelial Enrich (Dynal, 
Oslo, Norway) at 4^C for 30 minutes. Dynabeads mRNA DIRECT Micro Kit was used for 
mRNA isolation and cDNA synthesis. CD45 depletion was carried out as follows: For 
15 blood spiking experiments and for patient samples, RosetteSep Antibody Cocktail 
(StemCell Technologies Inc., Vancouver, BC) was used to deplete CD45 positive cells. 
RNA was isolated using an mRNA isolation kit (F, Hoffinann-La Roche Ltd., Basel, 
Switzerland). The Multiplex real-time PCR assay was carried out as described in Example 
11. 

20 The results from the blood spiking experiments indicated that the 

RosetteSep CD45 depletion prior to multiplex real-time PCR was the most efficient and 
resulted in the most sensitive detection of breast tumor cells in circulation. 

In addition, 42 peripheral blood patient samples from Senegal were tested 
using RosetteSep CD45 depletion followed by multiplex real-time PCR. 23 were found 

25 positive in the multiplex assay (Table 10). 35 bone marrow samples from breast cancer 
patients were tested using epithelial cell capture followed by multiplex real-time PCR. 23 
were found to be positive (Table 1 1). 
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Table 10:RosetteSep CD45 depletion and Multiplex real-time RT-PCR on Breast 

CANCER PATIENT SAMPLES FROM SENEGAL 





Multiplex 

assay 


firene identification 
via agarose gel 


PatholofiTV tissue bioDSV 


Sample 








Tis04 




Gaba 


ductal carcinoma, nomial 
inflammatory 


TislO 


... 


• 


ductal carcinoma, invasive 


Tis22 




Mammaglobin 


inadequate, inflammatory 


Tis25 




B305D+B726P 


inadequate, inflammatory 


Tis29 






lobular carcinoma, invasive 


Tis24 


+++ 


Mammaglobin 


ductal carcinoma, invasive 


Tis23 


— . 




normal inflammatory 


Tis28 




Ambiguous 


ductal carcinoma invasive 


Tis02 


-HH- 


Mammaglobin 


inadequate, inflammatory 


Tis07 


-+ 


B305D 


inadequate, inflammatory 


Tisll 


+++ 


Mammaglobin 


Suspicious 


Tis03 


--H- 


Gaba 


ductal carcinoma, invasive 


Tis08 


+- 


B305D 


ductal carcinoma, invasive 


D157-Normal 







- 


D346-Normal 


— 


- 


- 


D151-Nonnal 








TisOl 






ductal, invasive 


Tis05 






ductal invasive 


Tis06 






normal inflammatory 


Tis09 


+- 


Gaba 


inadequate, hyperplasia 

J. ^ m> IT ± 


Tisl2 






inadequate, inflammatory 


TislS 


„- 




ductal invasive 


Tisl4 


+~ 


B305D 


inadequate, normal fibrous 
tissues 


TislS 






ductal carcinoma 


Tisl6 


-HH- 


Mammaglobin 


ductal carcinoma invasive 


Tisl7 






normal inflammatory 


TisOlS 






ductal carcinoma 


Tis26 


+++ 


Gaba 


ductal carcinoma invasive 


Tis27 




Gaba+Mammaglobin 


lobular carcinoma invasive 
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Trinlicate 
Multiplex 
assay 


Crene identification 
via agarose gel 


Pdtholoc^ ti^mie hinn^v 


D140-Normal 


___ 






Tis32 


++- 


Mammaglobin+Gaba 


low grade ductal carcinoma 


Tis34 


++. 


Mammaglobin-Kjaba 


+ malignancy 


Tis38 





_ 


+ malignancy 


Tis40 


++- 


B726P 


+ malignancy 


Tis41 


++- 


B726P 


+ mflli0riancv 


Tis42 






9 


Tis43 






7 


Tis47 


+++ 


Mamma 0^1 ohin + 
Gaba 


? 


D487 


— 


- 


Normal 


D244 






NoTTnal 


D476 






TsFormal 

J. > yjJiJLi.x.cix 


D65 






Normal 


TisSl 




Mammafflobin + 
Gaba 


inadeauate inflammatorv 

XXXUVLVM U'CVWj XXXXXMJIXAXXXC4i<VX V 


Tis33 






+ ma1i0nanrv 

■ xxxcixx^xxciixw y 


Tis35 


— + 


Gaba 


+ malioTianpv 
' ixicxxxgxicuiv/ y 


Tis36 


++- 


Gaba 


-h malioTianrv 

( xixaii^iJLCiiiwy 


Tis37 


-h-H- 


Mammafflobin + 
Gaba 


H" maliffnancv 

' xxxcux^xxcuiw y 


Tis39 


-H-. 


Ambiguous 


inadequate 


Tis44 


+" 


Ambiguous 


? 


Tis46 


+- 


Ambiguous 


? 


Dl 51 -Normal 








D214-Normal 








DlOO-Nonnal 









Table 1 1 :Epithelial Cell Capture and Multiplex Real-Time PCR Assay on Bone 
MARROW Samples from Breast Cancer Patients 



Sample 
ID 


Tumor 
Status 


Tumor Type 


Bone Marrow* 


Blood 








Right 


Left 
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4 


TlbNO 


Invasive 


Mammaglobin 


- 


Nd 


5 


T2Nlb 


Invasive 

J-fUDUiaT 


- 


- 


Nd 


6 


T2Nlb 


Invasive 


- 


- 


Nd 


7 


T2N0 


Invasive 

UXJLL'lcil 


Mammaglobin 


GABA/B726P 


Nd 


8 


T3N1 


Invasive 
LfODuiar 


- 


GABA 


Nd 


9 


T2N0 


Invasive 

JL/UClal 


Mammaglobin 


Mammaglobin 


Nd 


10 


TlcNO 


Invasive 

LyUClal 


- 


- 


Nd 


11 


T2Nlb 


Invasive 


B726P 


- 


Nd 






invasive 
Ductal 






D/Zor 




1 icrNo 


Invasive 
Ductal 


(jrAJbSA 


Mammaglobin 




14 


TlbNO 


Colloid 






_ 


15 


T2N0 


Colloid, 
ductal 




B726P 




16 


TlcNO 


DCIS, 
Invasive 
uuciai 


GABA 




Mamma- 
globin/B726P 


17 


TlcNO 


Invasive 
jjuciax 


- 


Nd 


- 


18 


TlcNlb,c 


Invasive 
i^ucxai 


- 


- 


- 


19 


TlbNO 


Tubular 

^^oTv./ iiiuina 


B305D 


Mammaglobin 


- 


20 


TlbNO 


Invasive 

JL/UCldl 


Mammaglobin 


- 


B726P 


21 


TlcNO 


Invasive 
jL/uciai 


- 


GABA 


- 


22 


TxNl 


JLUVdol Vv 

Ductal 








23 


TlcNO 


Invasive 
Ductal 


Mammaglobin 


Nd 




24 


T2,TlbN0 


Invasive 


GABA 


B726P 
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Ductal 








25 


TlhNO 


Ductal 


iVlalllllla^lU Olll 

/ 

B305D 






26 


TlcNO 


Invasive 
Ductal 


GABA 






2.1 


1 ICNO 


Invasive 
Ductal 




GABA 


B305D 


TO 
ZO 


OXTA 

/NO 


Invasive 
Ductal, 

LLiL/tiiCll 


A X 1 "L * 

Mammaglobm 






29 


TINO 


Invasive 
Ductal 


B305D/GABA 


B305D/GABA 


Nd 


30 


T2Nlb 


Invasive 

T Aniimr 


Nd 


B305D 


- 


31 


T2N0 


Invasive 

LJ U-C- lal 


- 


- 


- 


32 


TlcNO 


Invasive 

i^U U UlCll J VI liW lal 


- 


- 


- 


33 


TlcNO 


Invasive 

JUUUUlaX 


- 


- 


- 


34 


TlcNO 


Invasive 


- 


GABA 


- 


35 


T3N0 


Invasive 

JL'Uvull 


GABA 


- 


- 


36 


T2Nlb 


Invasive 
Lobular 


GABA 


- 


- 


37 


TlbNla 


Invasive 
Ductal 






Mammaglobin 


38 


TlaNO 


Invasive 
Ductal 


Mammaglobin 







*Gene identification by gel analysis 
Nd: not determined 



From the foregoing it will be appreciated that, although specific 
embodiments of the invention have been described herein for purposes of illustration, 
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various modifications may be made without deviating fi-om the spirit and scope of the 
invention. Accordingly, the invention is not limited except as by the appended claims. 
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